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CHAPTER I 
INTRODUCTION 

Statistics affects everybody, and touches life at many 
points. As citizens we help to provide statistical 
information — our very entry into the world and exit 
from it are recorded statistically— and propagandists 
daily try to convince us of something, or even to fool 
us, by means of statistical facts and arguments. The 
running of the community through its institutions of 
government and business depends very much on 
statistical information, and this dependence increases 
as business tends to become concentrated in larger 
concerns and the government intervenes more and 
more to plan our economic and social life. 

The propagandists, administrators, and business 
executives who use (and misuse) statistics arc fairly 
numerous; and to them may be added such people as 
politicians, social students, and social reformers who 
employ statistical facts and methods to provide a basis 
for policy. Such facts and methods also have an 
important place in the development of sociology and 
economics as sciences ; the methods are very important 
to experimentalists in most branches of biology, and 
arc used by workers in the more exact sciences of 
physics, chemistry, and engineering. Statistical ideas 
are at the root of many current theories m biology, 
physics, and chemistry: indeed, a statistical approach 
is probably one of the most characteristic features of 
modern science. Finally, statistics as a subject is 
naturally a major interest to the comparatively small 

body of professional statisticians. 

As a result of the many approaches to the subject, 



2 STATISTICS 

the word statistics and its associated words statistical 
and statistician have various meanings. First we have 
the dictionary definitions, in which statistics refers in 
the singular to the subject as a whole, and in the plural 
to numerical data. I shall adopt both usages. To the 
* man in the street ’ statistics are just figures, and he is 
inclined to think of the statistician as being primarily 
one who counts the numbers of things. To the 
economist, used to the qualitative ideas of economic 
theory, statistical is almost synonymous with quanti- 
tative. To the physicist, statistical is the opposite of 
individualistic or exact, since to him statistics is a 
subject that deals, above all things, with groups and 
probabilities rather than with simple entities and 
certainties. To the experimental scientist who is used 
to gaining knowledge by conducting experiments under 
controlled conditions, statistical methods are those 
which are employed when accurate experimental 
control is impracticable or impossible. The field of 
application of statistics is mostly (but by no means 
entirely) economic, and so the statistician is sometimes 
thought of as a kind of economist. On the other hand, 
statistical methods are basically mathematical, and 
many people think of a statistician as sometliing of a 
mathematician. One might almost say that the mathe- 
matician accepts the statistician as an economist, and 
the economist accepts him as a mathematician. Some 
cynics think statistical methods so uncritical that one 
can ‘prove’ any^thing by them; and others think they 
are so critical that they can prove nothing. At the 
other extreme arc those enthusiasts who think that as 
a means of increasing knowledge the power of statistics 
IS boundless and almost magical. These vie\v8 are 
justifiable but incomplete; and the purpose of this 
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book is to give a complete and (as far as I can make it) 
balanced view of the whole subject. 

There are several general considerations which may 
profitably be borne in mind when approaching the 
subject of statistics. 

First, it is both a science and an art. It is a science 
in that its methods are basically systematic and have 
general application; and an art in that their successful 
application depends to a considerable degree on the 
skill and special experience of the statistician, and on 
his knowledge of the field of application, e.g. economics. 
Statistical methods are not a kind of automatic machine 
into which numbers can be put and from which 
perfect results can be taken. Nevertheless, the subject 
is not a closed mystery, and I believe that it is not 
necessary to be a statistician to appreciate the general 
principles underlying it. 

As a science, the statistical method is a part of the 
general scientific method, and is based on the same 
fundamental ideas and processes. This point will 
frequently come up in tliis book, and suggests one 
reason why the study of statistics is good educationally. 
It teaches the scientific method in terms of things of 
everyday experience, and inculcates a habit of scientific 
approach to ordinary economic, social, and political 
problems. It will be seen, however, that statistical 
methods have their own special features. These arise 
from the fact that the data are not simple, like those 
that usually result from a well-designed and well- 
controlled scientific experiment, but are relatively 
complex, being the result of a number of causes all 
operating together without control. Statistics deals 
with figures that are subject to uncontrolled variation. 

A* 
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Another feature that statistics has in common with 
other scientific subjects is that it is not finished and 
complete; it is always developing. Despite its power 
and essential usefulness, it has limitations and im- 
perfections ; but future developments will undoubtedly 
reduce these. 

The scope of the subjects included under statistics 
is wide; and few, if any, statisticians are expert in all 
branches. Some specialize in the development of the 
mathematical theor>' underlying statistical methods, 
and arc essentially mathematicians. Others are 
interested in the methods themselves, both elementary 
and advanced, and in their general application to 
almost any field, although they often have also some 
special experience of one field. There are also statis- 
ticians who arc able to use with confidence only 
elementary methods — perhaps fairly simple tables, 
diagrams, and averages — b\it who have a very wide and 
deep knowledge of some field of application. As will 
be seen later, knowledge of this kind is very important, 
and its use makes the work of such a statistician much 
more than the mere clerical work of tabulation that it 
sometimes seems to be. Statisticians in this category 
specialize; and often are as much economists or 
sociologists, say, as statisticians. One who is an 
expert in trade statistics may or may not know much 
about the statistics of public finance, but he will 
probably know very little of vital statistics; and an 
expert in vital statistics who wishes to deal with agri- 
cultural statistics, say, may have much to learn and 
much experience to gain. 



CHAPTER II 

THE RAW MATERIAL 

The conception of statistics as having to do with 
figures is the most popular one for very good reasons. 
Many of the questions that are the subject of common 
conversation and controversy require numerical data 
for their resolution. Trains are more (or less) crowded 
than buses; the English are better (or worse) patrons 
of sport than of the arts; women drivers are more (o 
less) competent than men drivers; and so on. I hese 
are the kinds of questions that are argued in newspaper 
columns, drawing-rooms, common-rooms, and public- 
houses. The disputants give their various e.vpericnces 

both relevant and irrelevant; one ahvays Vtn 

train and never has to stand; another once had to 
stand on a train journey but has never stood in he 
bus; another has his own motor-car and does not use 
buses or trains, but knows that road transport gives a 
better (or worse) goods service than the railways; and 
soTe discussion ^proceeds, and probably 'e-es every- 
one at the end with the same opm.on as that w h 
which he started. But if someone 
really reliable and cogent numencal 

on a wide experience, the question is settled and the 

discussion ‘peters out'. Life would be "1; 

were debarred from discussing questions of fact wc 
do not fully understand, and the ‘ know-all who 
regularly ruins such discussions with his facts is little 
more thL a bore. But we cannot afford to trifle wit 
important subjects by ill-informed controversy 

General impressions are entirely 
some facts and events strike the imagination more than 

s 
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others and are more easily remembered. For example, 
if we have a theory that the weather changes with the 
rising and setting of the moon, we notice the occasions 
when our theory is borne out and forget those when 
it is not ; and it is only by taking systematic records 
that we can arrive at the truth of the matter. Even 
when a general impression is qualitatively correct, it 
may be quantitatively incorrect. If the morning train 
to town is occasionally late, we are apt to feel that it is 
more often late than not, whereas an actual count 
might show that it is late on only one morning in ten, 
on the average. The following instance of a general 
impression being corrected by numerical data exem- 
plifies a common occurrence. A few years ago the 
L.M.S. railway company analysed their passenger 
statistics to show how many journeys of various lengths 
had been made, and the late Mr. Ashton Davies 
reported: ‘ great deal of valuable and interesting 
information came to light, much of it contrary to the 
then current opinion. For example, it was quite a 
common belief that the railways had lost most of their 
former short-distance passenger business, owing to the 
competition of road transport. The dissection, how- 
ever, revealed that quite a significant proportion of 
the passenger business of the L.M.S, Company still 
consisted of really short-distance traffic.’ 

We shall sec in later chapters that statistical methods 
are applied to the results of physical, chemical, and 
biological experiments and observations, as well as to 
results obtained in social and economic investigations. 
The making of obscr\-ations is usually a major part of 
a research in an experimental science, and instruction 
m experimental technique and in the handling of the 
necessary apparatus forms a large part of the training 
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of the physicist, chemist, and biologist. This is no 
place for a discussion on such a subject, which belongs 
properly to books that specialize on the sciences 
concerned. In social and economic research, on the 
other hand, the collection of the material requires 
little apparatus beyond a pen or pencil and paper, and 
no experimental technique; and possibly because of 
this the subject does not always get the attention it 
requires. Considerable knowledge and c.xperience are 
necessary to know where to go for statistical material, 
how to ensure its accuracy, and how to interpret its 
meaning; and since these are matters that are regarded 
as falling within the scope of statistics, we must 
consider them at some length. 

In order to obtain statistical material we may cither 
go to the records of some public body that collects and 
publishes statistics as a routine, or make a special 
survey. 

The most important routine collectors and suppliers 
of statistics are governments. The systematic record- 
ing of trade statistics by the English Government 
started with the appointment of an Inspector General 
of Exports and Imports in 1696; the first British 
census of the population was held in 1801. Since the 
early years of the nineteenth century the volume and 
scope of British official statistics have increased enor- 
mously and continuously right up to the present day. 

The statistical publications of the British Govern- 
ment are indexed and briefly described in a Guide to 
Current Official Statistics, published annually by the 
Stationery Office. There are listed in this Guide 
some 500 reports, issued by every department of state, 
and covering a wide range of the nation’s life and 
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activities. The most important and well-kno^vn 
subjects are finance, trade and industrial production, 
employment and unemployment, prices, health and 
mortality, and population; but there are also more 
romantic subjects, such as reports of coroners’ pro- 
ceedings on treasure trove and statistics of smuggling 
seizures. ^ 


Government statistics were originally required for 
administrative purposes, and their publication to and 
use by independent investigators is in some degree a 
by-product. Nevertheless, some government depart- 
ments, at least, take seriously their function as 
collectors and suppliers of statistics for general use 
and It IS well understood that one of the chief values 
ot published reports is to provide material for inde- 
pendent investigators. The very existence of the 
Guide suggests this. Some government officials who 
deal with statistics arc also Fellows of the Roval 
btat.st.cal Society, where they have contact with 
statisticians outside the government sei^-ice. These 

c fms '• suggestions and criti- 

cisms of sta .st.cians and to be somewhat influenced 

b> them in deciding the form and content of published 
ftfa""can“ofl‘‘'’'V"''“‘'i®“'°' '•■s^blisl.er his bona 
published statistical details. 

Mud, good can be said of British official statistics 
ut considerable improvements arc possible. There 

t r aiZ'^r ^ 

l4e.yto u,mffi::id3r;a;ort''\ffim'™ 

taken decennially is too infrequent a'nd 
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demand for a quinquennial census. Statisticians and 
business men both want a Census of Distribution to 
give information, now almost entirely lacking, about 
the distributive industry. 

A second complaint is that too long an inter\ al some- 
*times elapses between the collection of material and its 
publication. For e.xample, the final report of the 
Census of Production of 1930 was published in 1935. 
A third complaint is that statistics about similar or 
related subjects, collected by different government de- 
partments, are not co-ordinated, so that they often 
differ in scope and definition. This makes it im- 
possible to use these data for making comparisons and 
considerably reduces their usefulness. The volume 
and cost of official statistics arc so great, their subjects 
so comprehensive, the agencies that use them so many, 
and the purposes for which they are used so varied, 
that there seems to be an unanswerable case for a 
central department to secure the efficient collection and 
publication of official statistics and avoid inconsist- 
encies, overlapping, and waste. However, the case has 
not yet been conceded by the British Government. 

The governments of other countries also publish 
official statistics, varying in comprehensiveness and 
reliability, and the League of Nations is responsible for 
a considerable amount of important statistical work. 

In Great Britain, many semi-public bodies such as 
the Bank of England, the stock and produce exchanges, 
and trading associations, regularly publish statistical 
material, mostly of a commercial character; and much 
of this is reproduced in financial journals and in the 
‘city’ columns of daily newspapers. 

Statistical information is given in a ‘ potted ’ form 
in a variety of year-books; for example, the Statistical 
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Abstract for the United Kingdom, published annually 
by the Stationery Office, contains about 400 pages of 
tables summarizing British official statistics. Such 
summaries have their uses, but also their dangers, for 
the tables are given with very little explanation and 
there is always a risk that the reader may misinterpret 
the figures. In addition, statistical and economic 
journals, notably the Journal of the Royal Statistical 
Society, contain many critically prepared digests of 
statistical information concerning a variety of subjects, 
and although these do not rank as primary sources, 

inexpert with greater 

confidence than the original unedited figures. 

Special surveys for obtaining statistical information 
arc made by governments, unofficial bodies, and 
private individuals; and in this field the last two have 
led the way. Indeed the chief function of unofficial 
surveys has been to supply the deficiencies of official 
statistics. The Manchester Statistical Society and the 
Statistical Society of London (now the Royal Statistical 
Society) were founded in 1833 and 1S34 with declared 
objects that included the collection of statistics 
illustrative of the condition of society and in the 
early years of these societies this formed a fair propor- 
tion ot their activities. In 1886, Charles Booth, a 
London shipowner and merchant, started his famous 
and ver>' extensive survey of the conditions of life of 
the people of London. Because of its comprehensive- 
ness and Its statistical character this is regarded as a 
pioneer ^vork. It has been followed by a very large 
number of social surx-cys in different parts of this 

U.b.A. I think It is true to sav that until recentiv 
most non-official statistical activity has been concerned 


THE RAW MATERIAL 11 

with the social condition of the people. Special 
statistical inquiries are also made in the commercial 
and political worlds. 

The complicated analysis to which statistical data are 
often subjected, and the highly condensed form in 
which they are summarized as averages, give the final 
results of an investigation a form and order that often are 
not obvious in the original figures, and an appearance 
of accuracy and precision they do not necessarily 
possess. In contemplating the finished work it is all 
too easy to forget the raw material from which it is 
made. Nevertheless, no statistical results can be 
reached that arc not already implicit in the data, and 
the accuracy of the former depends on that of the 
latter. It therefore behoves anyone who uses statistics 
• to exercise care in obtaining them, and if he is going 
to use those already published, to examine them 
carefully for errors and to understand their exact 
meaning. 

In order to do this effectively much knowledge is 
needed of the way in which the figures are collected, 
of the circumstances surrounding the facts recorded 
and of the kinds of errors that can arise. As a check 
on accuracy, the results of one inquiry can often be 
compared with those of another to see that they are 
reasonably consistent, and the data can also be tested 
for internal consistency. For example, when vital 
statistics were first collected in some colonial depen- 
dencies in Africa, some twenty years ago, it was found 
in one instance that more children died under the age 
of one year than were born. That is an extreme 
example of internal inconsistency exposing error ! The 
following quotation from Mr. B. Seebohm RowTitree’s 



194* report on his social sur\-ey of York gives a 
picture of the care and attention which are devoted 
to this business of the collection of reliable statistics: 

‘ Obviously, in making a house-to-house inquiry every- 
thing depends upon the skill, tact, and reliability of the 
investigators. It took some time to discover just the 
right people, but eventually seven were found, five 
women and two men, on whose work full reliance 
could be placed. . . . Moreover, a number of “check” 
visits were paid, at random, or to cases that seemed 
abnormal, and in that way the accuracy of the returns 
was tested and verified.’ 

In the following paragraphs I give a few examples of 
the difficulties and pitfalls that exist in the collection 
and interpretation of statistical data. 

It is obviously foolish to place reliance on figures 
that are palpably false. A woman’s statement of her 
age is proverbially unreliable, and the information 
gained by asking questions on matters that arc ill- 
defined. or matters of opinion, depends somewhat on 
the way the ciuestions are framed. Market investi- 
gators have fouitd, when investigating the reasons why 
people buy particular brands of goods, that the direct 
question is not likely to give the true reason, as people 
do not all indulge in honest self-examination, and those 
who do may not be honest with the investigator. 
Indeed, the person questioned may not have thought 
about the subject before, so that the result of the 
inquiry is influenced by the inquiry itself. This must 
often happen in psychological investigations. Reliable 
data on household expenditure are alwavs difficult to 
obtain— how many housekeepers keep even approxi- 
mate records of the way in which they distribute their 
expenditure ? 
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It may have been noticed by some readers t^b at 
governmLt officials in collecting returns often sho 
what seems to be almost a passion for putting people 

into classes, unless the data can be given in a w 1 - 
defined form such as age or place of rth hei 
registering for military service, for example, each man 
ha! a classification number describing his 
and however unique that occupation may be it mus 
be fitted into a class. This is because dasaffication . 
a fundamental part of the statistical method, as wc 
shall see later, but it is done by the official on the 
spot’ because only there is the complete mformation 
available which enables an accurate assignment to the 
rppropriate class to be made. Difficulties often arise 
beLuse of ‘border line cases*. In the Census of 1931 
householders were required to give the industry m 
which the members of their househoffi were =mP‘oyed 

l::draXTmVo;::TrnTr^^^^^^ 

'“unitmify^and accuracy can be attained in such 

• ♦ onfv if very full and precise definitions are 

^u!d!;^ I— ■ 

E&”;r:fer- ^ 

whatto eXln this book were not precise, and there 
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was a tendency to include actual thefts; it was feared 
that this tendency varied from one district to another 
and from one time to another, thus destroying the 
value of the returns. When, in 1932, this book was 
abolished, and the police had to make up their minds 
whether or not the goods were stolen, the number of 
recorded indictable offences in the Aletropolitan 
Police area (not ntcessarily the number of crimes or 
indictments) rose from 26,000 in 1931 to 83,000 in 
* 932 - 

The reliability of statistical observations depends 

\ cry much on the way in which they arc made as well 

as on the ease with which the required information can 

be given. Much information is gathered from returns 

and tjuestionnaires completed by people who are not 

interested in statistics — citizens, taxpayers, business 

men, farniers, factory managers, public officials, and 

so on — and it must he recognized that people do not 

like filling in forms. The farmer is interested in 

growing and selling crops, and he regards the making 

of statistical returns as a pestiferous waste of time; 

e\en a statistician would probably be impatient if. for 

the information of another statistician unknown to him, 

he had to interrupt his work on, say, the world trade 

in ants’ eggs, to make a return of the number of 

man-hours occupied in the investigation. Therefore 

it is \\ise not to rely too much on the conscientiousness 

of people in completing returns; and to remember that 

the results are likely to be reasonably reliable only if 

the questions are few, straightforward, and easy to 
uT^swer. 

Where the required information is complicated or 
difficult, enumerators or ‘field-workers’ are usually 
employed. The enumerators employed in making the 
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Census have very full instructions as to how the census 
forms should be filled up, and experienced field- 
workers such as usually conduct social surveys know 
the snags and are not likely to be misled, so that the 
figures they obtain are usually fairly accurate. 

Although data made up of defined measurements are 
preferable, the statistician often has to deal with vague 
quantities that are matters of personal judgement, such 
as general health and intelligence. To give estimates 
of such things any value at all, the obser\er must be 
specially careful to standardize and define the basis of 
his judgements as far as possible, so that he can obtain 
consistent results that may at least be valid for making 
comparisons. One important stage in doing this is to 
divide the quantity into a number of parts, and to give 
points for the parts separately, adding the points to 
obtain the final result. School examinations provide 
an example. The candidate answers a number of 
questions, each of which is marked separately; and 
some examiners even subdivide the marks for a 
• question, giving so many for the correctness of the 
answer, so many for the correctness of the method 
by which the answer is obtained, so many for the 
orderly presentation of the argument, and so on. 
Recent investigations have shown that examinations 
do not measure attainment with great exactness — a 
result which shows that even when considerable care 
is taken, it is difficult to make reliable data that have 
a subjective basis. 

However accurate and self-consistent statistical 
results may be, they cannot be used safely unless 
everything is known of the way in which they were 
obtained and of the real meaning belund the figures. 
It is not often that all the detail surrounding any body 
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of information is published with the figures, and so 
things are not always what they seem. For example, 
British criminal statistics give, not the numbers of 
crimes committed, but the numbers reported to the 
police; and the nvo are very different. The propor- 
tion of crimes that become known to the police varies 
with the kind of crime and from time to time, according 
to the changiiig attitude of public opinion to the various 
crimes ; and the statistics give very misleading impres- 
sions of the amount of crime extant. 

I have already stated that some of the categories 
used in describing data may have to be defined arbi- 
trarily. It is necessary to be aware of differences that 
exist between departments of one government, and 
between countries, in the definitions they adopt for 
what is nominally the same quantity. Those who use 
statistics in the form of series extending over some 
time have also to be on their guard lest some change 
in definition or other basis should break the con- 
tinuity of the series. For example, the figures pub- 
lished periodically of the numbers of the unemployed 
include only insured w'orkers registered at the ex- 
changes on certain dates, and these are affected from 
time to time by legislative changes in the classes of 
workers who may be insured (e.g. in the age limits) 
and in the qualifications for unemployment benefit. 
Statistics of causes of death extending over long 
periods of time are apt to be affected by changes in 
medical knowledge and (dare a layman suggest?) 
fashion causing changes in diagnosis. So important 
is continuity in recorded statistics that statisticians 
almost prefer an existing unsatisfactory basis to be 
maintained rather than suffer changes that may in 
many ways be improvements; and they arc very in- 
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sistent that when changes are made, two sets of figures 
should be obtained for some time, one on the old basis 
and the other on the new, so that the old series can be 
joined to the new. 

The intelligent interpretation of final statistical 
results often requires a knowledge of unrecorded 
circumstances surrounding the events recorded. For 
example, in an investigation on accidents in naval 
dockyards it was found that the recorded accident rate 
among apprentices tended to decrease year by year 
through their apprenticeship, whereas that among 
naval artificers tended to increase, although both 
groups were doing similar work. The explanation of 
this difference is that the apprentices worked under 
industrial conditions and lost ‘time’ and money when 
away from work because of an accident; the artificers 
worked under service conditions and did not suffer 
this loss. 

The method of inquiry by sample, which is much 
used in social work, has its own special difficulties and 
sources of error; that method will be dealt with in 
Chapter VI. 

To give readers a more concrete and integrated 
picture of what is involved in the collection of data 
for a statistical inquiry, I am going here to give some 
detailed comments on an actual example. The Ministry 
of Transport has published three Reports on Road 
Accidents occurring in Great Britain in the years 1933. 
1935, and 1937, and these contain statistical summaries 
of a number of details of the accidents; the inquiries 
that provided the information for these reports are 
the example. In commenting on the collection of the 
data, 1 know nothing of what went on ‘behind the 
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scenes’, but some information is given in the Reports 
and the rest has been surmised. 

Presumably the Ministry hoped that from a statis- 
tical summary of the circumstances surrounding road 
accidents, something would be learnt of the causes, 
and this purpose, as well as administrative and practical 
considerations, would be borne in mind when deciding 
what data to collect. The police in various parts of 
the country reported the details of the accidents and 
it was therefore necessary- to ask only for such in- 
formation as such a scattered body of men could give 
reliably and uniformly-. In the earlier Reports, 
estimates were given of the speeds of the vehicles just 
prior to the accident, but such estimates were ad- 
mittedly unreliable and were not given in the 1937 
Report. There arc also other details that would have 
been very- useful but in the circumstances had to be 
omitted. Thus, the previous accident and medical 
history of the drivers would have helped to determine 
whether medical or psychological fitness had any'thing 
to do with the tendency of a driver to become involved 
in accidents. 

Since it is not to be expected that the police would 
fill in the necessary questionnaires with as much zeal 
as they give to their ordinary duties, it was desirable 
to limit the number of questions and not to ask for 
unnecessary- details. The statistician who organized 
the inquiry had to know enough about traJfic condi- 
tions to decide vhich questions were important, and 
which could be omitted as having little or no im- 
portance (e.g. the colour of the vehicle or of the 
upholstery). 

Each accident was reported on a form, thus en- 
suring that no details were overlooked and the data 
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were collected uniformly and systematically by the 
various police officers. Some of the details were 
definite and required little explanation, e.g. the date, 
place, and time of the accident and the number of 
persons killed and injured, and so on. Others, such 
as the cause of the accident, needed careful definition. 
The police were not asked to record the cause in their 
own words but sixty-four possible causes were listed 
and fully described, and the policeman who reported 
the accident stated which one of these operated at the 
accident in question. Thus a basis was provided for 
grouping the accidents according to cause. Those 
who described these causes needed a considerable 
knowledge of road traffic to ensure that the list was 
exhaustive. Another piece of information asked for 
was the extent of the injuries to injured persons 
whether fatal, serious, or slight, and elaborate in- 
structions were also given for defining this. 

The development of the details of an investigation 
of this kind requires much thought and planning, and 
only a superman would be able to decide the best 
methods straight away. It is quite clear, fron^ die 
changes between the 1933 and the *937 inquiries m 
the information sought, that the Ministry officials 
gained experience as they went along; the 1937 
questionnaire and method of inquiry, which are given 
and described in the corresponding Report, arc a 
result of this experience, and are a good example of the 
first stage of a well-arranged statistical investigation. 

Accuracy and reliability in the data are important 
because their lack cannot be supplied by elaboration 
and care in the subsequent statistical treatment. 
However, as this is an imperfect world, most data are 
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imperfect in some degree and many are very imperfect. 
Nevertheless, they usually contain some information 
and so arc far from valueless ; and it is the statistician’s 
job to make the best he can of them. It is a mistake 
to think, as some do, that inaccurate or unreliable 
figures should not be given careful treatment ; they 
may not merit it, but they certainly need it. For 
example, extra care is necessary’ to allow for the in- 
accuracies and avoid arriving at false conclusions. 
Thus, the statistician will first do all he can to obtain 
data that are as precise as possible, and will then apply 
his methods of analysis to make the best possible use 
of the figures he obtains. 


CHAPTER HI 

ARRANGING AND PRESENTING THE 

iM.ATERIAL 

The results of the first stage of a statistical inquiry 
are sometimes a few fairly simple figures which can 
easily be presented and understood without any special 
treatment ; but more often there is an overwhelming 
mass of data and detail. The first task of the statis- 
tician is to reduce these in the two senses of («) making 
less the amount of detail and (6) bringing the data into 
a form whereby the significant features stand out 
prominently. 1 he statistician must get out of the 
situation in which he cannot sec the wood for the trees. 
It is easy to state in general terms how this is done: 
the unimp<irtant details are decided upon, and the 
data arranged so as to suppress these and leave the 
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important features clearly expressed. The process is 
essentially one of summarizing. 

In fact, this is already started when the field and 
scope of the inquiry are chosen. With the whole 
universe before him, the investigator chooses the 
subject of, say, the housing conditions of one city at 
one time, and he ignores as irrelevant everything 
except a few particular facts for the city, such as the 
numbers and sizes of houses, their distribution, the 
numbers and composition of the families living in 
them, and perhaps the incomes of the families. It is 
not supposed, of course, that the ignored facts are 
absolutely unimportant, indeed they may afterwards 
have to be considered even in relation to the original 
subject of the investigation. The state of housing in 
a city may, for instance, later be related to unemploy- 
ment or to other things that are omitted from the 
original inquiry because it is impossible to deal with 
everything at once. But some selection must be 
made, although this may depend partly on such 
accidents as the investigator’s interest. This process 
of isolating some small part of the universe for study 
is common to all scientific investigation. 

The first and most important step in the statistical 
reduction of data is usually to group into one class the 
items that, for the particular purpose in view, need 
not be distinguished. When many items are put into 
several groups in this way they are classified. For 
example, the Statistical Abstract gives the yearly 
values and quantities exported from the United 
ICingdom of over a hundred kinds of articles. To 
reduce these figures to manageable proportions, and 
obtain a broad picture of the export trade, it may be 
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sufficient to use the Board of Trade classification into 
the three broad classes: I. Food, Drink, and Tobacco; 
n. Raw Materials and Articles Mainly Unmanu- 
factured; III. Articles Wholly or Mainly Manufac- 
tured. In this scheme no distinction is made between 
biscuits and fish, both of which come in Group I; 
between coal and wool, which are in Group II; or 
between coke, steel, and horses, all of which are in 
Group III. 

Sometimes the subject falls easily and naturally into 
a few categories. Thus, if families are grouped ac- 
cording to the number of children, the categories will 
naturally be o, i, 2, 3, etc. children. Frequently, 
however, the subject is such that the classes have to 
be created more or less arbitrarily, as for the exports 
just mentioned. There are three points to be obser\’ed 
in niaking such classifications. 

First, any given body .of results can usually be 
classified in many wavs, and the best way will depend 
. on the purposes of the inquir\'. If the aim is to relate 
changes in export trade to changes in employment, the 
exports might he so grouped as to include coal among 
manufactured articles, because the coal industry 
employs a lot of labour. For other inquiries it might 
be better to group the articles according to industries 
— fish, coal, iron and steel, textiles, engineering, and 
so on, or according to the amount of shipping space 
required per million pounds’ worth of the article. 
One ditficulty in the way of using existing published 
figures is that they are not always cla.'^sified in a way 
that is suitable for the particular inquiry. The choice 
of the basis of classification is not a matter of statistical 
method, but retjuircs special knowledge of the subject 
of investigation. 
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The second point, implicit in the whole idea of 
classification, is that all items grouped together should, 
for the purposes of the particular inquiry', be sufficientlv 

alike; that each class should be homogeneous, lable i 

gives a summary of the death rates in England and 
TOes in 1938, and in the upper part of the table all 


Table i 

DeaOis during 1938 of Persons of Various f 

Persons of Corresponding Age living at the Mid 1 far. 

England and Wales 


Age, years 

Death Rate 

O-IO 

8.4 

10-20 

»*5 

20-30 

2*6 

30-40 

3 * 

40-50 

5-7 

50-60 

127 

60-70 

2Q*2 

70-80 

73-3 

80- 

1738 

O-I 

55 « 

x -5 

4-6 

S-io 

1-9 


people in any one decade of life are classed together^ 
The figures given in the lower part of the table show 
that the o-io years group is far from homogeneous for 
whereas the average death rate for the group is 8 4, 
that for the first year of life is 55-1. 'he rates for he 
remaining years being very low. For 
studies, the grouping by decades may ^ 

for a study of infantile mortality much 6“/ 8™“^ 
is necessary; and figures are in fact even g 



2^ STATISTICS 

separately for the first few months of life, although the 
death rates used in this connexion are not expressed 
as in Table i, but as deaths per looo live births. The 
death rates for the separate years over 8o are also 
probably far from uniform, but we are not often 
interested in studying mortality at these ages and the 
variation is therefore unimportant and may usually 
be ignored. 

Lack of homogeneity within the groups may always 
be detected by examining the figures, as we have done 
those in Table i, but the investigator can often decide 
from his general knowledge what variation is likely to 
occur, and can adopt a scheme of classification accord- 
ingly. Usually, the variation in the whole material is 
so great and complex that it is impossible to have 
classes that are perfectly uniform. However, the 
statistician must classify, and so some variation within 
classes must usually be tolerated. Skill and experi- 
ence are ncccssarv to steer safely between the Scylla of 
having too few broad classes, with much variation 
withit^^ach, and the Charybdis of having too many 
fine clssses, each with very little variation. 

Consideration of the third general point about 
classification motlifies in some degree the application of 
the second. Consider Table 2. .\ccording to the full 
classification giNxm there it appears that pedestrians 
are the ‘ villains of the piece ’ in causing road accidents ; 
of all the classes, they caused most accidents. Hut it 
is unfair to class all pedestrians together and to separate 
the ^•a^ious kinds of drivers, ^\’hen the drivers are 
classed together it is seen that they caused 37-6 
per cent, of the accidents and were almost as culpable 
as pedestrians. 'I'hus we see that false impressions 
may be created if the classes are not of the same rank, 
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SO to speak. It is impossible to say what is the correct 
grouping in such instances — probably none is abso- 
lutely correct — but we should realise that the grouping 
can affect the impression created by the data. 

Table 2 


Numbers of Fatal Road Accidetils in Great Britain caused 
by Various Classes of Road Users, 1936-37 


Class of Road User 

Accidents 

Number 

Percentage • 

Drivers of private motor- 
vehicles, except motor-cycles 
Drivers of motor-cycles • 

Drivers of public conveyances . 
Drivers of vans, lorries, etc. 
Drivers of other vehicles, except 
pedal cyclists 

Pedal cyclists .... 
Pedestrians .... 
Other persons 

868 ^ 

868 

3?3 

40J 

I, OS* 

2,470 

*33 

1 

i 4'8 

14-8 

37-6 

5-7 

0 - 7 . 

* 7-9 

42-2 

2-3 

Total . 

S.855 

100 OJ 


The most usual way of presenting statistical informa- 
tion is in a table of figures. Early in the nineteenth 
century there was some controversy between those who 
preferred to present results in a literary form and those 
who preferred tables and who were accused of present- 
ing only the ‘ dry bones However, the ‘ table school ’ 
won the day, and it is perhaps an echo of this contro- 
versy that the origin^Epcospectus of the (then) London 
Statistical Sopi«t3rd^ared ^he aims of the Society ‘ to 
confine its^^^entibn Hgorously to facts — and, as far as 

■ I — t • , 


f 
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ir may be found possible, to facts which can be stated 
numerically and arranged in tables.’ 

Statistical data are usually presented in tables, even 
in ordinary newspapers, but some \\Titers still seem to 
prefer giving their figures in a more literarj' form. 
The question may be one of taste and training, but I 
find the tabular method of expression much clearer. 
It does not make tl\e figures any less dr>' to have them 
strung out in sentences and joined by words and 
phrases such as ‘whereas’, ‘on the other hand’, ‘as 
against’, and so on. I am not here criticizing the use 
of words to point out special features or contrasts 
shown by results given in tables. 

There is an art in arranging a table to present data 
economically and clearly, and in a way to facilitate any 
comparisons that the reader mav be required to make. 
Howe\er, the arrangement cannot add to or subtract 
trom the significance of the figures, and I need not 
write anything further on the subject here. 

Diagrams and charts are also much used in present- 
ing statistics, and have a value because even statistical 
ones g^ve some delight to the eve and add a spark of 
interest to a paper. Their chief importance, however, 
is that tliey give a picture of the broad statistical facts 
that is more readily taken in than a table. It requires 
a careful examination of the figures of a table to 
appreciate their full significance, and great concentra- 
tion of thought is necessan,' to keep the general picture 
in mind while reading the figures in detail. Magni- 
tudes are more easily appreciated and remembered 
when conveyed to the mind by pictures than by 
numerical figures. On the other hand, the broad 
picture given by a diagram is not as exact in detail 
as that given by a table of figures, and since it is 
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somewhat affected by the way in which the diagram is 
made, it may even give a misleading impression. I 
am now going to give a few examples to illustrate these 


Figure 1 

' Relation between Milk Consumption and Income 


Income ptt Person 
per Week 


up to 1 0 s 


lOi to \ 5s 


15 s. to 20 s 


20 s to 30s 


30 s to 45 s. 


over 45 s. 


Milk Consumption per Person per Week 

One full bottle repfcsents one half^pmr of fresh milk, 
one full tin represents one half-pir>t of coniJenscd milk 


DD5 ng 


gg 


gg 


statements and to show how some important types of 
diagram are read. 

Figure i shows Sir John Orr’s estimates of the 
weekly consumption of milk just previous to 1936 
by people in six groups classified according to the 
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average income of the group. Each man, woman, and 
child is counted as a person, and the estimated family 
income and consumption are divided equally between 
the members of the family for the purposes of present- 
ing the results. This is a common type of simple 
diagram and we can see at a glance that (i) the 
consumption of fresh milk is greater than that of 
condensed milk in all classes, (2) the consumption of 
fresh milk increases considerably as income increases, 
(3) the consumption of condensed milk decreases 
slightly as income increases, and (4) the changes in 
total consumption of milk arc dominated by the 
changes in consumption of fresh milk. 

An alternative and more austere form of diagram is 
obtained by using, instead of the rows of milk bottles, 
long thin rectangles proportional in length to the 
quantity represented. Some serious-minded statis- 
ticians prefer this form of representation and are 
scornful of ‘pictorigrams’. However, there is nothing 
unsound in their use provided they are correctly done, 
and 1 think there is cvcr\’thing to be said for presenting 
data in as attractive and striking a way as possible, 
even by using coloured diagrams if they can be 
afforded. Sometimes, of course, the subject defies 
acceptable pictorial representation — it would require 
some ingenuity to represent the death rates of Table i 
or the accident figures of Table i by pictures not too 
macabre for modern taste. 

The above data of milk consumption may also be 
used to show how the method of representation can 
affect the impression created by the diagram. These 
same data are represented in Figure 2 by six bottles 
and six tins proportional in volume to the relative 
consumptions for the six classes. Figures 1 and 2 
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present the same set of facts, but the changes in 
consumption are much less striking in Figure 2 than 
in Figure i. It is usually better to adopt a method 
like that used in Figure i, where the quantity is 
represented essentially by a length, since it is difficult 



to appreciate quantities by areas or volumes, particu- 
larly if the latter arc inadequately represented on two- 
dimensional diagrams. 

The apparent degree of fluctuation of a quantity 
can be made to be almost anything we please by 
choosing the scale of the diagram suitably. This is 
illustrated by the diagrams in Figures 3 and 4. 
Figure 3 presents a statistical history of a social 
phenomenon that for years has been a subject of great 
public concern, viz. unemployment. The monthly 
figures of men unemployed at the times of the counts, 
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expressed as percentages of numbers of insured 
workers, have been taken from the Statistical Abstract, 
and are represented graphically. The fluctuations 
seem enormous, but. although they are slightly 
affected by occasional legislative changes altering the 
basis of the figures, they substantially represent the 
actual changes in ‘unemployment’. Figure 4 shows 



the other side of the same picture and gives the 
numbers employed at the limes of the counts, expressed 
as percentages of the total insured workers. The 
fluctuations in Figure 4 seem to be much less than 
those in Figure 3, but they arc actually the same 
fluctuations, reversed in sign (or turned upside down) 
and drawn oit a different scale. The statistician 
chooses his scale according to the impression he thinks 
the figures should convey, and that impression will of 
course depend on the object for which the figures are 
being used. 



arranging and presenting material 31 

Fiffures 3 and 4 are typical examples of the well- 
known time charts that are used to depict changes m 
quantities with time, and are so widely understood 
that even daily papers with mass circulations use them. 
From Figure 3 we see the dire effects of the 1931 32 
trade depression and tf\e smaller effects of the 1926 
General Strike. Figure 4 shows the same effects but 



1924 


puts them in a different perspective, and 

ihese profoundly important changes touched only 1 e 

fringe of our toil industrial effort as measured by the 

nurnber of people registered as bemg 

The detailed examination of a graph, noting 
or a fall in level here and there, is a good thing to 
undertake when there are particular events to which 
the fluctuations can be related; but more often it^s 
desirable to notice the general <=haracter of the 
fluctuations-to obtain a 'bird’s-eye view . There are 
several patterns to which time fluctuations may 
conform, although the combination of two or mor 
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patterns may make the resulting form of the graph 
ver>' complicated. However, it is useful to everybody 
to be able to recognize the various patterns, and where 
there are several combined to be able to separate them; 
so in Figures 5-10 1 give a few illustrations and 
describe the significant features of the changes they 
represent.* 

Figure 5 shows an exceptionally steady upward 
00 

JD 
60 
to 

40 

30 
20 
10 


trend in the deaths due to cancer, from about 50,000 
in 1919 to nearly So, 000 in 1938. This is partly due 
to the increasing proportion of older people in the 
population — older people are more liable to suffer 
from cancer — and is probably affected also by the 
increasing accuracy with which cancer is diagnosed. 
Whether these facts are sufficient to explain the 
increase in deaths I cannot say. 



I he dau for Fiffures 5*^ are from the Statistical Abstract, 
and those for Fisure 10 from ' Marriacc Frequency and 
Ecoi^mic Muctuauons m England and Wales. 1851-1934’, 
by U \. Glass, This paper is in Political Arithmetic 
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From Figure 6 we see that the numbers of cases of 
smallpox rose fairly steadily from a few hundreds per 
annum in 1919-21 to 10,000-15,000 in 1927-30 (many 



readers will remember a smallpox scare in those years) 
and fell practically to zero in 1935-38. There are also 
some random fluctuations of negligible importance. 



The fluctuations in deaths due to influenza shown 
in Figure 7 are violent, and at first sight seem to follow 
no pattern. On closer examination, however, we see 
a tendency for years with large numbers of deaths to 
alternate with those with small numbers. Such a 
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pattern, in which the form of the fluctuations repeats 
itself at regular inter\’als (two years in this instance), 
is described as periodic. The periodicity ‘ misses step 
in 1921 and 1926, while in 1931 and 1935 the peak is 
not high, but the general tendency to a periodicity is 
undoubtedly there. We may sum up the situation by 
stating that the annual deaths from influenza fluctuated 
between 5,000 and 45,000, and that usually, but not 
always, a year with a large number of deaths was 



followed by one with a small number, and vice 
versa. 

'bhe fluctuations in deaths due to whooping-cough 
(Figure S) are also large, and are difficult to summarize. 
I do not know of any factors to which 1 can relate 
the variations, and can only describe the main features 
of the graph as a downward trend from 5,000—6,000 
deaths per annum in 1920 and 1921 to about 1,500 in 
i93^> with occasional ‘peak years* in which the deaths 
rose to 7,000-8,000. 

Figure 9 gives a fair idea of the amount of motor 
traflic on the roads quarter by quarter, and of the 
numbers of fatal accidents year by year. There is a 
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general upward trend in the numbers of motor 
licences, the regularity of which is interrupted only 
by the major effects of the depression years of 1931-32- 
Superimposed on this trend, however, is a pronounced 
seasonal pattern. Each year the number of 1 ‘cences 
rises from a low value in the first quarter (28th 
February) to a high value in the second (31st May) 
and a still higher value in the third (31st August); 



and in the fourth quarter (30th November) there is a 
slight fall, which is succeeded by a much larger fal 
in the first quarter of the next year. A seasonal 
pattern of this kind, which is a special example of a 
periodic fluctuation, often occurs in data that are 
given weekly, monthly, or quarterly, although it is 
not often so well marked as the pattern of Figure 9. 

The accidents given in Figure 9 are drawn on such 
a scale that, if the increases year by year had been in 
the same proportions as the numbers of licences, the 

B* 
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general rise in the curves would have been much the 
same. Up to 1934 the numbers of accidents and 
motor licences rose and fell fairly well together . After 
1934, however, there was first a drop in the number of 
accidents, which then remained fairly steady at about 
6,400, whereas the number of licences resumed its 
large annual increase. This relative improvement in 
the accident figures after 1934 may reasonably be 
attributed to the conscious efforts, legislative and 
other, of the nation to reduce accidents; and the 
result, although not good enough, is encouraging. 

The last examples of time series are in Figure 10. 
The index numbers of ‘ real wages ’ plotted there take 
into account wage rates, the amount of unemployment, 
contributions to and from social insurance and un- 
employment funds, and changes in the level of prices. 
It is dilficult to obtain accurate figures on these 
subjects fiir as far back as 1850, and how far the index 
measures the well-being of the workers is not clear, 
but the results plotted in Figure 10 give rough measure 
of changes in the prosperity of the workers. The 
lluctuations in the marriage rates are affected by 
changes in the proportions of the population that were 
of marriageable age, as well as in marriage habits. 
However, it is not our main purpose to speculate on 
the causes of the lluctuations shown in Figure 10 but 
merely to note their form. 

First, if we pay attention only to the slow, long- 
period changes, we notice a rise in real wages from 
the first decade, 1850-60, culminating in a peak soon 
after 1900, and thereafter a slight fall to a fairly 
uniform level. Superimposed on this movement is a 
somewhat irregular wave-like movement, with peaks 
indicated roughly by arrows. These peaks do not 
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occur at uniform intervals as do those for the influenza 
data in Figure 7, but the wave-hke pattern is well 
marked. Other economic data show somewhat similar 
fluctuations, which are described as alternating booms 
and depressions and constitute the well-known 
phenomenon of the business or trade cycle. Super- 
imposed on the waves are smaller random fluctuations 
which produce sporadic peaks and valleys of no great 



The slow movement of the marriage rate is a down- 
ward trend up to about 1880, and thereafter the 
fluctuations are about a trend which rises ver>' slightly 
until about 1900. There is a period of wild fluctua- 
tion about 1915-20, w'hich tvas due to conditions 
caused by the last Great War. and a marked rise m 
,004.-35 to a value which was maintained until i93e. 
Wrsee also a wave-like pattern up to the year 1910. 
with peaks as marked roughly bytthe arrows, spaced 
at irregular inten.'als. I shall later discuss the relation 
between the ttvo series in Figure 10. 


that in summarizing and presenting 


Thus we see 




38 STATISTICS 

his material the statistician classifies it if necessar>\ 
and makes use of suitable tables and diagrams. The 
picture he gives is impressionistic, and he must do 
his work skilfully and honestly if he is to avoid creating 
false impressions. The person who receives the 
statistical information also has an active part to play. 
For this he needs to know how to read tables and 
diagrams, and to understand their meaning; and the 
more he knows of the general principles on which 
they arc made, the more critically is he able to examine 
them and the less likely is he to beled astray. 


CHAPTER IV 

SOME SPECIAL TABLES AND DIAGRAMS 

OF IMPORTANCE 

Table 3 gives the weights of fifty apples from the 
same tree. It is typical of a large class of statistical 
data in that it refers to a number of things of the same 
kind, varying in some measurable character, and I 
propose to show how such figures are dealt with. 
Usually there are hundreds or even thousands of 
results in a single collection; there is room here to 
give only fifty but they will be enough to illustrate 
the methods. 

As they stand, the figures in Table 3 are an almost 
meaningless jumble, but we can reduce them to order 
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by applying the general methods of classification and 
sLmariaation mentioned in the last chapter. The 
weights vary between 68 and 223 grams, and m Mew 


Table 3 

Weights of Apples, grams 


106 

223 

139 

81 

136 


107 

125 

119 

131 

123 


76 

iti 

1*5 

75 

90 


82 

XO9 

107 

1*5 

93 

187 

95 

02 

86 

70 

126 

68 

105 

130 

128 

100 

84 

129 

*13 

204 

111 

84 

*15 

104 

98 

no 

no 

80 

78 

1 18 
82 

lb6 

90 

99 

107 


of this wide range it is obviously unnecessary to 
distinguish between apples differing by only a few 
trim! Even two small boys, faced with such a 
Lllcction and seeing how different the apples can be, 
might be satisfied tL they were being treated nearly 
enough alike, if one was given an apple weighi g 


Table 4 


Frequency Distribution of Weights of Apples 

Weight {grams) 

Frequency of Apples 

60- 79 

80' 99 

100-119 

120-139 

140-159 

160-179 

180-199 

200-219 

220-239 

5 

*4 

18 

9 

• • « 

z 

1 

1 

Total 

50 
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So grams and the other one weighing say 90 grams. 
However, we will be content to regard as equivalent, 
apples differing by up to 20 grams, make a few broad 
classes covering the whole range of weights, and count 
the apples in each class. The results are in Table 4. 
The stth-ran^cs, as they are called, are in the first 
column, and the numbers or frequencies of apples are 
in the second column of the table. Table 4 is an 
example of a frequency distributioHy so called because 
it shows how the frequencies of apples are distributed 
between the various classes of weight. It is a sum- 
marized form of Table 3, and in obtaining it we have 
suppressed little or no detail of any importance, even 
though tlie classes are very broad. 


Tile two essential elements behind a frequency 
distribution are the things that are counted, called 
the individuals, and the quantity or quality that is 
measured and defines the classes, called the character. 
An individual, in the statistical sense, may be a person 
or a thing; it may be a concrete thing like an apple, 
or something more abstract like a vote or an experi- 
mental obscr^’ation; and it mav be something we 
ordinarily recognize as a single entity like a man, or 
complex entity like a family or a business concern. 
The character of the apples is called quantitative 
because it is described by a numerical measurement, 
but quahtatxve characters, which are described in 
words are also met with. l or example. Table a 

imli\“M. frequency distribution in which the 

ht ‘tr r 7^" road accidents and the 

I .Z" cr 'for road-user causing the accident. 

forndn^'n ■ -^I'aractcrs the methods of 

hri si , ^h-rndardized, so 

that a standard interpretation is possible. 
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In order to discuss what a frequency distribution 
really means I shall use an example based on more 
adequate numbers than fifty. The life of an electric 
lamp is the number of hours it burns at a standard 


Table 5 


Length of Life of Electric Lamps 

(Data by E. S. Pearson, of the Royal Statistical 

Society* 96 » i933> 

Life {hours') Frequency of Lamps 


c- 200 

200— 4OO 

400- 600 
600- 800 
800-1,000 
I jOOO-1 ,200 

1. 200- 1 ,4C»o 

1.400- 1 ,600 

1 .600- 1 ,800 

1.800- 2.000 

2 . 000 - 2,200 

2.200- 2 ,400 

2.400- 2,600 

2.600- 2,800 

2.800- 3,000 
3,000-3,200 

3.200- 3,4^^ 

Total 


150 


voltage, and in Table 5 the results for . 50 lamps hat e 
been grouped into classes with sub-ranges of 200 
hLrs For the sake of more vivid representation, 
tW^distribution is given as a frequency diagram in 
Filre . , where the lamps in each class are piled 
fnT column proportional in height to the number m 



Frequency 
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the class. These columns are usually represented 
by plain rectangles, or alternatively the tops of the 
columns may be joined by sloping lines; I have used 
the pictorial form here in order to help readers to 
understand what a frequency diagram is. 



In considering a frequency diagram, small irregu- 
larities in outline such as the minor peak for the 200- 
400 hour group in Figure 11 are ignored, and notice 
is taken only of the general shape, which is sometimes 
shown by a smooth cur\'e draw'n through the points 
of an actual diagram. A diagram of a given general 

shape means the same thing to statisticians all the 
world over. 

e see that the diagram in Figure 1 1 is spread from 
^t>out o to 3.400 hours, showing the extent of the 
variation in life ; that there is a peak, showing a 
tendency for the lamps to be concentrated about a 
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typical value between 1,200 and 1,400 hours; and 
that there is a reduction in height towards the sides, 
showing the comparative rarity of lamps approaching 

the extremes of life. 

In order to see one way in wh.ch a ^ 

tribution can arise, let us imapne a ®>\o°<'"g 
in the form of vertical strips instead of the fam l a 
concentric rings, the centre strip being the Ml 
and let us consider a marksman shooting at this many 
times from a rifle. The shots will be peppered over 
the target. There will be more shots m 
strip than in any other, and as we move fr J the 
centre towards the edges each strip m turn will hate 
fewer and fewer shots, until the extreme strips will 
have very few indeed, or none. If we count the shot 
in each strip, and draw a frequency diagram, the 
shape will be like that just described for the lamps 
with a peak at the centre and tailing off towards the 
edges I do not suggest that all frequency distribu- 
tions arise in this kind of way, but this 
often helps readers to see what a frequency diagram 

"’There are two main things to notice about a diagram 
like Figure 1 1 , after its general shape ; these are the 
positiorof its peak and its width. The peak for 
Figure II is in the 1,200-1.400 hour group; if we 

had another batch of lamps with “ f 

1 ,800-2,000 hour group, we should probably prefer 
those lamps as having a longer typical life. Ihc 
width of Ac distribution measures the degree of 

variation about the typical value. For 
imagined batch of lamps with a typical life of i,»oo 
2000 hours might have a distribution ranging from 
or4.5^ hours®, and this batch would be much more 
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variable in life than the original batch. If we had 
two marksmen shooting at similar targets of the kind 
just described, a good marksman and a poor one, the 
frequency diagram for the good one would be narrow 
with a tall, sharp peak showing little variability in the 
placing of the shots; that for the poor marksman 
would be broad and squat, with more of a knoll than 
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a peak, and showing much variability. This char- 
acteristic of variability is a highly important one. 

The more or less symmetrical bcil-like shape of 
Figure 1 1 is the most common in frequency diagrams, 
but other shapes do exist, which describe other tvpes 
of variation, bor example, the distribution among 
the people of Great Britain of wealth in most of its 
forms is such that the vast majority are relatively poor 
and a very few people arc very 'wealthy; and this 
distribution is represented by an extremely lop-sided 
diagram of the kind shown in P'igure 12, which shows 
the distribution of sur-ta.\ payers according to income. 
T he uneven distribution of these incomes is even 
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^ore pronounced than appeal ^ 
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and (b) the numbers living in each age group and 
exposed to the risk of death; and it is because these 
numbers decrease for the highest age groups that the 
numbers of deaths in those groups decrease. 

A list of the characters of several hundreds of 
individuals seems to be an overtvhelming and chaotic 
mass of information, particularly if the items are in 
some irregular order in which they happen to have 
been recorded. A frequency distribution is an 
economical summary of such figures, since it replaces 
the several hundred readings by a table with some 
ten or twenty entries, and it reduces the chaos to an 
order the mind can comprehend. Indeed, the things 
we neeu to know about such a body of observations 
are not really many or complex. ^Ve need to know 
whether there is a well-defined typical value of the 
character, and if so what the value is; to what extent 
the individuals vary about that type; and whether 
the variation extends equally above and below the 
type. These facts are readily seen from and described 
by a frequcnc)’ table or diagram. A roughly drawn 
frequency curve is often used to give a vivid if ap- 
proximate description of the nature of variation in a 
distribution, and with its aid a statistician can sum up 
a statistical situation, somewhat as a clever cartoonist 
can, with a few strokes of liis pen, convey a whole 
complex of moods or ideas. In Figure 14 are drawn 
on the same scale a number of pairs of imaginary 
Irequency cur%-es of some unspecified character, to 
show the kind of tale they can tell. 

If each individual has two characters that have been 
obser\cd, we may classify the observations according 
to the two characters, and form a nvo-way frequency 
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table. If the character is qualitative this is knmvn 
as a contingency table: Table 6 is an example. The 
individuals are the persons killed m the f:cuien^ 
referred to in Table 2 (p . 25). and the two ghara^ 

Figure 14 

Imaginary Frequency Distributions. 

Tht characltf 14 i" •*** 

frequency the dlfeaion f 



are more very high ana very I ^ ^nixturc of two well- 

defined^pesrwher^s (2) 
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very low values in (a, ihan in ( .)• 
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...ed m son. 
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accidents.) We see that 1,064 drivers were killed in 
accidents caused by drivers, 261 in accidents caused 
by pedal cyclists, and so on. The entries in the row 
and column labelled ‘Total’ form two-frequency 

Table 6 

Numbers of Persons killed in Fatal Road Accidents in Great 
Britain and Persutu to tchont the Accidents are attributed, 

*936-37 


1 

1 

1 

Accidents attributed to 

\ Drivers 

Pedal Pedes- 
Pedal trians 

Cyclists 

1 

• 1 

1 

Other 

Persons 

Total 

Persons killed : 
Drivers other 
than pedal 
cyclists 

1,064 

*3 

*7 

1 

3 

‘.097 

Pedal cyclists , 

261 

gSa 

25 

1 

i,26g 

Pedestrians , 

438 

5 * 

2.440 

I 

2.930 

Otlicr persons 

5*7 

iS 

2 

1 28 

665 

'Potal . 

2,280 

1 .064 

2.4S4 

*33 

5.961 


distributions, and the rows and columns in the body 
of the table subdivide these two distributions, adding 
to our information. 

Table 6 shows clearly two characteristic features of 
contingency tables. First we notice that the fre- 
quencies arc not uniformly distributed, most being 
contained in the cells along the diagonal of the table 
starting at the top left-hand corner, i.e. in the cells 
containing the entries 1,064, 982, 2,440 and 128. 
This shows that most of the people were killed in 
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accidents caused by road-users of the same class as 
themselves, i.e. there is a strong tendency for the class 
of person killed and the class of person causing the 
accident to go together. The second thing we notice 
is that not all the frequencies are in the diagonal cells: 
that some of the people who suffered death are not of 
the same class as those who caused the accident : that 
there are exceptions to the tendency for the two char- 
acters to go together. We may sum up the situation 
by saying that rough retributive justice seems to have 
been done between the classes of road-users; justice 
because the victim tends to be of the same class as the 
person who caused the accident, and rough justice 
because he is not always of the same class. By the 
way, before making any attempt to explain the tore- 
coing results, readers should consider two things: 
(i) the person causing the accident is not necessarily 
the same individual as the victim even if they are both 
of the same class, and (2) if the victim of the accident is 
blamed for causing it he is not there to defend himself. 

The general statistical features of Table 6 which I 
have pointed out are described in statistical language 
by the word association, and we say that the two 
characters are associated. The word association con- 
notes both the tendency for a connexion between the 
two characters to show itself, and the deviations from 
that tendency. In Table 6 the tendency is pronounced 
and the deviations arc not very important, and we say 
that the association is strong. 

What would Table 6 look like if there was no 
association? The answer is: Like Table 7. 

The distributions in the ‘Total’ column and row are 
the same for Table 7 as for Table 6; they have nothing 
to do with association. In Table 7, however, the 
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frequencies in each column bear the same relations to 
each other as those in the ‘Total* column, and similarly 

Table 7 


Nttmbers of Persons killed in Fatal Road Accidents in Great 
Britain and Persons to tohom the Accidents are attributed, 

1936-37. No association 



Accidents attributed to 

Drivers 

Pedal Pedes- 

PeTal 

Cyclists 

Other 

Persons 

Total 

1 

Persons killed : 
Drivers other 
than pedal 
cyclists 

420 

196 

4 S 7 


1,097 

Pedal cyclists 

4S6 

226 

529 

28 ' 

1.269 

Pedestrians • 

1,120 

533 

1,221 

66 

2,930 

665 

Other persons 

2^54 

119 

377 

15 

Total . 

2,280 

1,064 

2,484 

133 

5.961 


for the rows. . For example, 420 is to 486 is to 1,120 
is to 254 as 1,097 is to 1,269 i® “>930 is to 665. 

The degree of association may var>' from extreme 
strength, when only one cell in each row and column 
has a frequency, through the stage shown by Table 6, 
to the zero association of Table 7, I enlarge below on 
this idea of strength of association. 

In Table 6 the association shows itself by most 
individuals being in the cells along one diagonal in the 
table. That is not an esse ntial feature and arises from 
the particular arrangement of the rows and 
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columDS. If the colemns. say. ^ 
be there. 

Table 8 

hon, .4 a 


Age of Wife 
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36 

5 

1 


108 

4 
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3*0 

37 
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4 

66 
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3 

34 

6 

24 
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1.33 
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4 

10 
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1 

10 

22 
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244 
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1 . 07 H 
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^70 

68 

4 


1,662 1,450 9«7 543 


202 35 ^ 5»317 


When the two ^^“'^“‘"VdardlzeT tor‘^^ anVthe 

* “^^e’indWraf of Table 8 ate .artied cou,des 

living together on and wife. There 

characters are ® ^ „ .^^hich is shown by the 

is a P'-°T'''',f,hrfrequrncies to occur in cells about 
rergCunl‘rhirg&^ who. tab. a character- 

istic appearance. 
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Association between two quantitative characters is 
called correlation, and we say that the two characters 
are correlated. The strong correlation between age of 
husband and wife is in accordance wth our general 
experience, for although men sometimes have wives 

Table 9 

Numbers of Husbands and Wives tcho died at Various Ages. 

Data from Gravestones in the Yorkshire Dales 

{Biometrika, z, 1907, p. 481) 
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39 
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much younger than themselves and, less often, wives 
much older than themselves, age in the husband is 
usually associated with age in the wife. 

If, however, we consider the ages at which husbands 
and wives die, the correlation largely disappears. 
W hether the wife dies when young or old makes little 
difference to the age at which the husband dies. This 
lack of correlation is shown by the appearance of 
Table 9, which refers to married couples whose ages 
at death were recorded on the gravestones of country 
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churchyards in the Yorkshire Dales. Table 9 looks 
nuL ifferent from Table 8 in that the frequencies do 
lot tend to be concentrated about a diagonal 

statistical analysis shows that there is a 

for this to occur and that there is consequently a tery 

"'tvVenTllr number of individuals is -all cormla 

tions may be shown to the e ye by a eorretoion dm, rum, 

Figure 1 5. 
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and the scatter of the points is less. The concentration 
of the points in Figure 15A gives it, in essentials, an 
appearance not unlike that of Table 6. 

This conception of correlation was introduced by 
Galton late in the nineteenth century, and it is now 
one of the most important and useful ideas we have. 
Correlation expresses the general idea of a relationship 
between two quantitative characters and also of 
departure from that relationship — of a relationship 
which is apparent and well defined when there is a 
number of observations, but which describes only 
approximately the connexion between the two char- 
acters for any one individual. 

When the two characters tend to increase together, 
the correlation is said to be positive, and when one 
tends to increase as the other decreases, the correlation 
is negative. In a table or diagram a positive correla- 
tion gives the same appearance of clustering as a 
negative one, but the figures or points tend to follow 
the other diagonal. The correlations in Table 6 and 
Figures 15A and n are all positive, and it is perhaps 
unfortunate that the rules for arranging tables and 
diagrams should be such that the table shows a ‘down- 
hill’ trend, moving from left to right, and the diagram 
an ‘up-hill’ trend for positive correlation. 

We may interpret correlation by imagining that we 
have taken a man at random, and that we wish to 
guess or ‘predict’ his age. If wc know nothing about 
iiim except that he is one of those recorded in Table 8 
we can only say that he is somewhere between 15 and 
about 9 ® 3nd that he is most likely to be near the 
typical age-group of 25-35. If. however, we are told 
that his wife is between 25 and 35, we see from Table 8 
that he will be between 15 and 75. The elfect of the 
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correlation is to reduce the range of uncertainty of our 
prediction. In Table 9, on the other hand, the range 
of uncertainty of prediction is from 25 to something 
over 95, if we know only that the man’s age is recorded 
on a gravestone in the Yorkshire Dales; and if we also 
know that his wife died between 65 and 75 years of 
age, say, the range of uncertainty is only very slightly 
reduced. The stronger the correlation, the greater is 
the accuracy with which a knowledge of one character 
enables us to predict the value of the other, as compared 
with the accuracy of the random guess. 

Figures 15A and b suggest another interpretation 
of correlation which is only sometimes valid. It is 
common sense that the crop will tend to increase with 
the area cultivated and with the yield per acre ; these 
two factors may be regarded as ‘causes’ of the crop 
variation, and the correlation a visible demonstration 
of their operation. This leads us to an interpretation 
of the scatter of the points. Had the cultivated area 
been the only causal factor that varied between 1920 
and 1938 all the points in Figure 15A would have been 
accurately on a line sloping diagonally. The scatter is 
due to the operation of some additional cause or causes, 
which our common sense tells us can, in this instance, 
only be those affecting the yield per acre. Similarly, 
the trend of points in Figure 15B is due to the causes 
associated with the yield per acre, and the scatter to 
the disturbing causes associated with the area cultivated. 
If the correlation is high, as in Figure 15A, the cause 
accounted for is relatively important and the disturbing 
causes are relatively unimportant. A lower degree of 
correlation as shown in Figure 15B may be interpreted 
to mean that the causes of which account is taken 
(those associated with yield per acre) are less important 
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relative to the unaccounted causes. The results of 
Figures 15A and B are incidentally of interest since the 
causes affecting the changes in cultivated area are 
mostly economic and political, those affecting the yield 
per acre are meteorological and technical. In the 
years 1920-38. although both sets of causes produced 
variations in the total crop of wheat, variations due to 
economic and political causes were more important 
than those due to technical causes and the weather. 

Thus we sec that when any cause affects the varia- 
.tions in a character, the effect shows itself as a 
correlation. But the existence of a correlation does 
not prove the existence of a causal relationship. Two 
characters can be correlated because they are both 
affected by a third group of causes, and sometimes they 
may simply happen to be correlated. Finally, I 
emphasize the fact that a correlation, like all statistical 
results, merely describes the relations within a given 
set of data, referring to a particular set of conditions 
and taken at a particular time. It may or may not be 
possible to generalize from such results. 

Correlation may show itself in most complicated 
ways when the two characters are in a time series. 
Then, the best representation is in a time chart like 
Figure 10 (p. 37), and it is necessary to consider 
separately the different features of the time variations. 
For example, for the years 1850 to about 1890 there 
is a general upward trend in real wages and a downward 
trend in the marriage rate, and as far as this trend is 
concerned there is a negath'e correlation, i.e. an increase 
in wages goes with a decrease in marriage rate. For 
the wave-like variations, on the other hand, the peaks 
of real wages tend more or less to coincide with those 
of marriage rate and these shorter-term movements 
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show a positive correlation, i.e. wages and the marriage 
rate tend to rise and fall together in the short-term 
movement. I do not wish to rush in where angels 
fear to tread and try to explain these results, but t ic) 
will mean a little more to readers if I suggest tentatively 
that, as far as the short-term fluctuations are concerned, 
the wave-like changes in prosperity as measured by 
the wage index have obvious causal effects on the 
marriage rate, and that the slow trends m the two 
series have little connexion with each other one being 
largely due to changes in social habits and m the age 
distribution of the population, and the other large V 
to changes in industrial efficiency. A correlation 
between time series that arises from two similar tren^ds 
is seldom due to a causal relationship; one that 
results from similar cyclical movements is sometimes 
due to a causal relationship; and one that results from 
correlated random movements is often due to a causal 
relationship. There are, of course, methods of 
statistical analysis that enable the various kinds of 
fluctuations and correlations to be separated out and 
measured more exactly than we can do by a cursory 
examination of the diagrams. These form a very 
large subject known as ‘The Analysis of Time Series . 


Frequency distributions of single characters give rise 
to the concept of variation about a type, and tables and 
diagrams relating pairs.of characters to the concepts 
of association and correlation. These concepts are 
essential to the statistician but they are also useful to 
the ordinary citizen, since they help in making sense 
o Ifigures that come within everyday experience. 



CHAPTER V 

‘EXPRESSING IT IN NUMBERS* 

On a wall of the Biometric Laboratorj' at University 
College, London, where much of the present science 
of statistics has been developed, is written the following 
motto : 

‘ When you can measure what you are speaking about 
and express it in numbers, you know something about 
it, but when you cannot measure it, when you cannot 
express it in numbers, your knowledge is of a meagre 
and unsatisfactory kind.* — Lord Kelvin. 

This motto well expresses the spirit that has inspired 
statisticians, and much of the work of the pioneers of 
the subject has been towards developing ways of 
expressing in numbers measures of statistical concepts 
which 1 have so far described only in general terms. 
I think the motto is an overstatement, since good has 
come of many qualitative studies — in biology for 
example — and it omits to state the requirement that 
the numerical measure should be of a kind that can 
be brought into a system and related to other quantities, 
or it is sterile and little better than qualitative know- 
ledge. .Ml the same, the development of numerical 
measures is a very important step in any science, and 
in this chapter I describe some of the chief measures 
in statistics. The use of these measures carries still 
further the process of summarizing data, bringing into 
prominence and describing the few features that are of 
most significance. 

1 he simplest statistical quantities are rates, ratios 
and percentages. It is not necessary for me to define 
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these here, for we are taught in the arithmetic lesson 
at school how to calculate them, but I point out one 
fundamental feature they have in common: thev all 
express the value of one quantity relative to another. 
A death rate expresses the number of deaths in a 
locality during a year relative to the number of people 
living in that locality; and the percentages of un- 
employed workers are relative to the numbers of 
insured workers. 

One purpose in tising such a method of expression 
IS to help people to grasp the meaning of the figures; 
to bring them home to the imagination. In common 
with most people I have very little occasion to consider 
the populations of towns and countries, and so wlicn 
I read that the population of Shanghai is one and a 
quarter millions, my imagination is unstirred. Some 
people try to present facts of this kind by some such 
device as suggesting that 1,250,000 people stood 
shoulder to shoulder would reach nearly from London 
* to Edinburgh. This does not help me in the least. 
But I know Liverpool and that it has a population of 
about 800,000; when I realize that the population of 
Shanghai is about one and a half times this figure I 
begin to have some conception of what that is. 

One of the jobs of the statistician is to find suitable 
standards of reference. Note however that the 
standards should be suitable. I remember, a few 
years ago, hearing the Chancellor of the Exchequer 
expounding his budget over the wireless and em- 
phasizing its enormous total — some eight hundred 
million pounds. To those of us who only handle a 
few hundred pounds of money in a year such figures 
mean only a tremendous amount of money. Had 
the Chancellor expressed the total as amounting to a 
c 
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rate of about £20 per head of the population, our 
comprehension ^vould have been a little better, 
although the impression gained by a man with a 
family income of £100 per annum would have been 
different from that gained by a £20,000 a year man. 

I think the Chancellor would have done best to have 
expressed the amount of the budget as a ratio of the 
national income at that time— roughly one-fifth. 

A second reason for expressing a quantity as a ratio 
or percentage of another is that the ratio may contain 
all the information that matters, actual values of the 
two quantities being irrelevant details. I-or example, 
if wc wished to compare the risk of death in two 
localities, it would be misleading to compare the 
numbers of deaths, as the populations in the localities 
might be ditferent, and the sizes of the populations are 
irrelevant. All the information we would need is 
contained in the death rates. 

Percentages are much used when it is desired to 
study the relative changes in some quantity with 
time without considering the absolute amounts of 
the cjuantity or the changes. 'I'hen the value of the 
<|unntitv at some given time or base is taken as a 
staiulard of reference, and the values at other times 
are expressed as percentages ol this. Such percentages 
are imfex niimbtis. In studying changes in real wages 
in figure 10 (p. 37), for example, wc arc nr>t in- 
terested in the absolute values of the wages, and so 
we express them as percentages of the wages received 
in a base vear — iqoo in this instance. The ^linistry 
of Labour cost-of-living index number is much used as 
a basis for making wage changes in certain industries. 
It is calculated from the cost to buy a standard list of 
goods that enter into the budget of an average working- 
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class family, and the excess of this over the cost in 
July 1914 is expressed as a percentage of the cost in 
July 1914. Index numbers are very useful for com- 
paring changes in quantities that differ in kind or 
magnitude, e.g. for comparing the relative changes 
in unemployment and exports. It could profitably 
have been used in Figure 10 had I expressed all the 
marriage rates as percentages of the rate for 1900. 

Some rates or ratios devised for special purposes, 
particularly those used in population and vital 
statistics, are very complicated. For example, in 
studying population trends, a ‘net reproduction rate 
is calculated from (a) the numbers of girls born in a 
given period, (b) the numbers and ages of the mothers, 
and (c) the proportions of the girls that will live to 
the various ages when they may themselves become 
mothers. This ratio is so calculated that if it is i-o 
the population is just maintaining its supply of pro- 
ducers of children — i.e. potential mothers. For 
England and Wales in i 934 - 3 fi» the net reproduction 
rate was 0-76, so that there were born only three- 
quarters of the girls necessary to maintain the popu- 
lation, assuming that fertility and death rates continue 
unchanged. 

Usually, the investigator has little difficulty in 
devising a reasonably suitable rate, ratio or percentage 
for his purpose. Thus, the most important cause of 
the great increase in fatal road accidents between 
1922 and 1938 shown in Figure 9 (p. 35) is 'ery 
probably the increase in the number of motors on 
the road — in the number of lethal instruments abroad. 
Indeed, one would expect that if everything else 
remained constant, accidents would increase pro- 
portionately with the number of motors, so that 
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variations in the ratio of the number of accidents to 
number of motors would indicate the effects of varia- 
tions in the other factors. However, we do not know 
the number of motors on the road, but the average 
number of licences current each year provides a good, 
if rough, measure, provided we may assume that the 
annual mileage of the average car did not change 

Table 10 

I\'utnf>cr oj Fatal Road Arcuicnts per 1,000 Current Motor 

Licences in Great Britain 
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much with time. Thus, a rate that may reasonably 
be regarded as indicating the effect of factors other 
than the number of motors on the road is the number 
t)f fatal accidents per i,ooo current motor licences. 

This rate, calculated for the data of higure 9, is given 
in fable 10. from which it may be seen that the other 
factors only became important in effecting the im- 
provement after 1934. Factors that could have 
inllucnccd the accident rate, but did not change 
enough to do so appreciablv at least up to 1934, 
include the character of the motors, the numbers of 
pedestrians and other road^users exposed to the risk 
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of accidents, the skill and care of drivers, and the 
state of the roads. 

Sometimes, however, it passes the wit of man to 
devise a suitable rate or ratio. In 193^ there were 
253 fatal road accidents in Lancashire and 88 in 
Derbyshire. Does this mean that Lancashire drivers 
are worse than those in Derbyshire? Not necessarily. 
The two counties differ in population, in numbers of 
motors on the roads, and in the length and character 
of the roads (i.e. in ratio of rural to urban mileage); 
and I do not know how to devise a rate that will 
properly take account of these factors and measure 
the relative standards of driving in the two counties. 

A ratio or a percentage is not always the best means 
of comparing two quantities; a simple difference is 
sometimes better. A difference in aeroplane speed 
of say 25 miles per hour is 12J per cent of 200 m.p.h. 
and only about 8 per cent of 300 m.p.h., but such a 
difference is equally important to a bomber trying to 
outstrip a fighter at both speeds. 

Rates, ratios and percentages are rather ‘ tricky 
quantities to deal with, and the unwary sometimes go 
astray in using them. Most of the errors are due to 
a neglect of the fact that these quantities are made up 
of a numerator and a denominator. For example, the 
percentage of all insured workers that arc -unemployed 
is : — 

number of insured workers unemployed 
total number of insured workers 

We call this, shortly, the ‘ percentage unemployment 
and so are apt to overlook the denominator, which 
does not appear in the short title of the quantity. 1 
give a few examples to illustrate this point. 
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The error of thinking that a large percentage change 
in a quantity necessarily means a large actual change 
is less commonly made than it was, and when the 
IMinister for the Production of This or That bids us 
rejoice because the output of this or that has increased 
by fiyc hundred per cent, eycn those of us who are 
not statisticians sceptically ask ' Fiye hundred per cent 
of how much ? ’ 

Changes in a percentage may be due to changes in 
the numerator, in the denominator, or in both; or the 
changes in the numerator and denominator may 
compensate for each other to keep the percentage 
unchanged. Thus, the percentage of the insured 
workers in employment in 1927 was about the same 
as in 1937, yiz. about 90 per cent; but the estimated 
numbers of insured workers in employment increased 
fmni about nine and three-quarter millions in 1927 to 
rather more than eleycn millions in 1937. 

Tlie following example is giyen by Dr. A. llradford 
Hill. 'I’he data of 'Pable 1 1 were used by someone to 
show the bad effect on the infantile death rate in rural 
districts of the lack of facilities for dealing with 
confinements. 'I'o this lack was attributed the higher 
percentage deaths under one month in the rural areas 
given in the last column of Table 11. In fact, as 
Or. Hill points out, the death rate under one montli 
of age IS lower in rural than m urban areas, and the 
relatively unfavourable percentage for the rural areas 
is due to the relatively favourable death rate for all ages 
under one year. i.e. to the reduction in the denominator 
of the percentage. A lack of facilities in rural areas 
may account for some infantile deaths, but the figures 
of 1 able 1 1 do not show such an effect. 

Before leaving the subject of percentages. I wish to 
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protest against the grandiloquent misuse of the term 
percentage when a simple ratio would be better. 

When a ratio is an awkward fraction, it is convenient 
and legitimate to multiply it by one hundred and 

Table i i 


Deaths per 1,000 Live Births in Urban and Rural Areas, 

at Various Ages 





Deaths under 


Deaths 

Deaths 

One Month 


under 

under 

as Percentage of 


One Month 

1 

One Year 

Deaths under 

One Year 

Urban . 

2967 

9 S -37 

31 

Rural 


58*66 

40 


convert it to a percentage; ‘seven per cent’ is a better 
phrase than ‘seven hundredths.’ When, however, 
the ratio is a multiple of unity, it is pretentious to 
express it as a percentage; it sounds very grand to 
talk of an increase of five hundred per cent, but it is 
better English to talk of an increase to six times the 
original value. Worst of all is a mixture of methods 
of expression. The mixture is rich in the following 
extract from a letter to a newspaper, even if we allow 
that the lack of intelligibility of the last sentence is due 
to the omission of a ^C'sign before the last figure: 

‘As far as British cycles are concerned, the best 
illustration is that of comparison with Germany, 
whose exports have fallen in the last few years to 
under 200,000. whereas during the same period British 
exports have increased to approaching 400,000, and in 
the recent trade depression while Germany has lost 
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nearly 6o per cent of her exports Great Britain has 
lost but 36 per cent. ... So, too, the importance of 
reciprocal arrangements in the Dominions may best 
be emphasised by the fact that, whereas in 1929 our 
exports to the self-governing Dominions and India 
amounted to £1,461,073, in 1931 the total is estimated 
not to exceed 400,000.’ 

When we have a quantity that varies from place to 
place or from time to time, and we wish to obtain an 
idea of what the Concise Oxford Dictionary calls ‘the 
generally prevailing degree or amount*, we calculate 
an average. The form in which it is usually calculated 
is known precisely as the arithmetic mean, although it 
is more often referred to in ordinary language as the 
average. It is the sum of the individual values divided 
by the number of indi\nduals. There are other 
averages, but they need not concern us here. 

The notion of an average carries with it, by implica- 
tion, the notion of variation; for we do not average an 
invariable quantity: we do not ordinarily talk of the 
average length of a day. When we calculate an 
average, however, we choose to ignore the variation 
and focus attention on the ‘generally prevailing’ value. 
This means a very big step in the process of statistical 
summarizing, substituting for the several individual 
values the one. Sometimes, however, people forget the 
variation they have ignored, and are misled by taking 
account of the average alone ; and it is because of the 
prevalence of this error that statisticians are at great 
pains to stress the inadequacy of this constant. The 
average has its limitations, but provided they are 
recognized, there is no single statistical quantity more 
valuable than the average. When we read that the 
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nation spent an average of 9s. per head per week on 
food in 1934 and that the average income was about 
30s. per head per week, we have conveyed two pieces 
of information that are striking and useful, even though 
they tell us nothing of the large variations from one 
person to another in the expenditure on food and 
income. The average life of the 150 electric lamps 
mentioned in Table 5 (p. 41) is 1,452 hours, and if we 
use such lamps in the home, where we can let each one 
burn out before replacing it, that average means some- 
thing. Such lamps at as. each are as valuable (i\s 
regards life) as lamps at is. each that have an average 
life of 726 hours ; and this statement is true irrespective 

of the variation from lamp to latnp. 

When a frequency distribution is, like that of the 
electric lamps, more or less symmetrical with a peak 
towards the centre of the range of variation, the 
average is an important descriptive constant, for it is 
near the typical value, and the variation is more or 

less the same above and below it. 

Many quantities are in fact averages, although they 
do not always appear to be so. Thus, in England 
and Wales in 1938, there were 478,829 deaths in a 
population of 41,215,000, giving a death rate of ii-6 
per 1,000. But the population is made up of people 
of all ages, following all sorts of occupations and living 
in many localities, and for every sub-division of the 
population there is a separate death rate. The crude 
death rate is an average of all these. 

Averaging is very useful in making index numbers. 
I have already described (p. 60) how, in order to 
measure changes in a quantity from time to time, the 
successive values are expressed as percentages of the 
value at some base period, to form index numbers, 
c* 
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Sometimes, however, the quantity cannot be defined 
by a single measure, but is (mathematically at least) 
a somewhat vague and nebulous idea, like the ‘price 
level’. The prices of the things we buy var\- from 
time to time, but in different ways. Some prices rise 
and fall together, but to different degrees; some prices 
rise while others fall. For example, between 1920 
and 1938 articles like motor-cars had a pronounced 
downward trend in price owing to changes in the 
methods of manufacture, whereas the price of coal 
did not change nearly as niuch. good cotton crop 
may result in a low price for cotton in the same year 
that a poor harvest results in a rise in wheat prices, 
behind all these various movements, however, 
economists see a mo\ement in the general price level 
due to common factors that affect the prices of most, 
if not all, goods in somewhat the same way — factors 
such as war and money policy. One way of measuring 
this movement is to calculate index numbers for the 
separate commodities and then to average them in 
one way or another. ’Fhc changes in the index 
nunibers for any one commodity are due to the 
combined eO'ects of the changes in the general level 
of prices and the special changes for the commodity; 
and in the process of averaging for mapy commodities 
the special effects tend to cancel out, leaving as the 
dominating factor in the combined index the changes 
in get^eral level. 

The Board of Trade, the Economist and the Statist 
indexes of wholesale prices in Great Britain are 
obtained in this wav. The Ministry of Labour cost- 
of-living index, already mentioned, is based on retail 
prices and is obtained in a slightly different way. 
Broadly, these indexes show much the same fluctua- 
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tions, but the fluctuations difTer in detail; and the 
economic statistician who knows about the detail can 
appreciate the economic significance of the differences, 
and suggest which index is most suitable for any 
given purpose. There are also other index numbers 
measuring changes in such quantities as wages, pro- 
duction, industrial activity, and prices of industrial 
shares. Indexes may also be used for making inter- 
national comparisons of quantities like real wages. 

Let us now consider the limitations of the average. 
Professor A. L. Bowley has written: ‘Of itself an 
arithmetical average is more likely to conceal than to 
disclose important facts; it is of the nature of an 
abbreviation, and is often an excuse for laziness. 
The average does not measure the important tacts 
that arise from the variation. In dealing with human 
problems such as nutrition, for example, it is as 
important to consider the individuals at the extremes 
as the average. It is no consolation to the man who 
can only spend, say, 4s- per week on food that is not 
sufficient for health, to know that the average ex- 
penditure is 9s. per head per week. 

We have seen that for the domestic consumer the 
average life of electric lamps has significance, but 
some large consumers such as public lighting auth- 
orities do not replace lamps as they burn out; they 
find it more economical to renew all lamps periodically, 
whether burnt out or not, and for them vanability m 
life is important. Suppose such an authority decided 
that it would renew lamps at such intervals that only 
4 per cent burn out before renewal (it would be very 
expensive to renew lamps so frequently that none 
were burnt out). Then for the lamps in Table 5 
(p. 41), the renewals would be made after 600 hours 
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of burning; for 6 lamps (=4 per cent of 150) have 
lives shorter than this. For this consumer the 
effective life of each lamp would thus be only 600 
hours, and not the average life of 1,452 hours. Had 
the lamps been less variable, the effective life would 
have been nearer the average life. 

Instances in which variation is of practical im- 
portance can be found in all fields — ^the strength of a 
chain is the strength of its weakest link, not that of 
the average link ; owing to variations in the strength 
of his materials and in the load a structure will have 
to bear, the engineer designs the structure w’ith a 
‘factor of safety!; the banker keeps a reser\'e of cash 
in the till to cope with variations in the demand for 
money ; the authority that supplies water allows for 
variations from time to time in the rainfall when 
deciding on the capacity of its reser\'oirs; the elec- 
tricity supply authority has to cater for a ‘peak’ load 
which, owing to variations in demand, is greater than 
the average load; and so on. 

The average of a frequency distribution like that 
shown in Figure ii (p. 42), has the merit that it is 
near the typical value, but when the distribution is 
like Figure 12 or 13, the average is not even typical. 
The average income of the sur-tax payers of Figure ;2 
is about £$, oqo \ the typical income is right at the 
lower end of the scale, between £2,000 and £3,000. 
The average age at death of the people who died in 
1938 (Figure 13) is 58 years, but such a statement is 
a very inadequate description of the distribution with 
its concentrations of deaths at ages under one year, 
and in the neighbourhood of 70 to 75 years. 

When data are in the form of a time series and 
averages are taken over a long period of time, they are 
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apt to conceal important changes in trend. l;or 
example, Table 12, which was- obtained by averaging 
monthly figures of unemployment, suggests an 
improvement in the situation, continuing up to the 
end of 1937; but on referring to I-igure 3 {p- S®)- 


Table 12 

Percentage of Insured Workers Unemployed. 

for Six-Monthly Periods 


Averages 


Period 

PcrcctiKige 

Unemployed 

Jan.-Junc 1936 

iA -2 

July-Dee. 1936 

12*1 

Jan.-Junc 1937 

1 1 *2 

July-Dcc. 1937 

10*5 


it will be seen that the percentage unemployment 

began to increase well before the end of i 937 ; 

nveraee figures conceal this fact. 

Evfn for studying variations, however, the average 

can be of great use, for we can divide the whole field of 

Investigation into sections and find separate averages 

for them. Table . (p. 23). f^^ws fw 

death rates for the several age groups and shows the 

variation with age ; the figures of 

represented in Figures . and 2 (pp. 27, 29) 

for the income groups, and show the variation in 

consumption with income* 

Variation has an important effect on the averag 

itself if that average is a weighted one. The average 
(kith rate in England and Wales m >938 was ri-6 
per thousand, but this value is not ^^tamed by adding 
up the nine values given in the top part of Table i for 
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nine separate age groups and dividing by nine; 
a calculation gives a rate of 34-5 per thousand. The 
rate of ii-6 results if, when combining the separate 
death rates, each is given a tceis’ht proportional to the 
number of people living in the age group, and could 
be calculated arithmetically by multiplying the death 
rates by the corresponding numbers of people, adding, 
and dividing by the total number of people. The 
important point to notice is that such a weighted 
a\crage depends not only on the quantities averaged, 
but also on the weights with which they arc combined ; 
and a change in weights may result ii\ a change of 
average t^uite ditferent from that which would result 
from a change in the quantities alone. I illustrate 
this from the following data. 

"^rhe Registrar General divides \\ ales into two 
districts : W.iles 1 containing the industrialized counties 
of the south. \ iz.: Brecknockshire, Carmarthenshire, 
(damorganshirc, and Monmouthshire; and Wales II 
containing the remaining counties which are not as a 
whole so industrialized. The death rates for 1938 
were 12-4 for \\ ales I and 141 for \\ ales II. In 
I'abic 13 1 have separated the death rates according 
to age-groups sufliciently finely divided for present 
purposes, and we .see that in each group, the rate is 
lower in Wales II than in XN'alcs 1 except for the one 
group in which the rates are equal — quite the opposite 
result from that of the crude total death rates. When 
we look at the age distributions of the populations in 
Table 13 we see the reason for this apparent discrep- 
ancy. Wales II contains relatively fewer of the 
younger men and women than Wales I and more 
people over 55. 'I’he death rate in ^^'alcs II is relatively 
high, not because that part of the Principality is more 
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Table 13 


Age 


0-5 

5-*S 

15-35 

35-55 

55-^5 

65-75 

75- 

Tolal 


De<ith Rate per 1.000 

Perccutage Number 
Lishig 

Wales I 'IVales II 

Wales J »•<//« // 

i6-5 *5-^ 

1-5 *-5 

3-3 11 

T-c 7-0 

23-3 

600 40-1 

150-3 145-6__ 

7-3 

17-6 «4-9 

32-2 3ft 

‘ 26-6 2^-2 

9-4 “'3 

5-1 "■= 

1-8 3-2 


j 100*0 100*0 

> 1 . - A A 


obtain two d-stricts. 

same population . -^Vales I to give the 

Let us use the . . . ,„ean death rate for 

weights. Then the w g distribution of the 

Wales II corrected to tne 

population of S substantially lower 

12 - 2 , etc.) -r 100-0 - 10 9 . vv..u« T This process 

than the rate m^a standard age distribution 

of correcting a dc^h rate 

is called 1 death rates are also 

statistics m orcai 
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standardised for the sex composition of the population, 
and the standard population used is one having the 
same age and sex distribution as the population of the 
whole of England and Wales in 1901. 

After the average of a series of figures, the next 
thing we usually consider is the amount of variation, 
ignoring for the time its form. One measure is the 
raripe, which is the difference between the lowest and 
highest values in the series, and 1 use it later in this 
book because it is easy for the beginner to appreciate. 
It is not favoured by statisticians, however, except in 
limited circumstances, partly because it uses only the 
extreme values ii^ the series and is unaffected by the 
spread of the values in between. The variation as a 
whole may be disclosed by measuring the individual 
values as differences from their average, and may be 
sximmarizcd bv averaging these differences, thus 
obtaining a quantity known as the mean . 

.Another measure, called the furujnee, is obtained by 
stiuaring anti averaging these differences, and the 
stjuare root of the variance is the standard deviation. 
There are also other, less important measures of 
variation. The reason for having so many measures 
and for jircfcrring one to another need not concern us 
here. It is sufficient to note that the measures arc 
roughly ctjuivalent — for example, they would all show 
a lower value for distribxition (2) in Figure iqn (p. 47) 
than for distribution (i). 

Measures of \ariation suffer from the same general 
limitation as the average in that thev ignore something 
— the form of the variation ; but since it is not often 
that we need to compare distributions or series of data 
for which this form ilitTers much, the limitation has 
less effect on the usefulness of the measures of degree 
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of variation than on the usefulness of averages 
Indeed the averages and the measures of variation 
together cover most of the needs of the practi«i 
statistician, but their interpretation and use m com- 
bination require a good knowledge of statistical theo > . 

A development of great importance in applied 

statistics has taken place during the past decade or so. 
ani results from a rLognition of the fact that varia^n 
is a composite quantity, resulting from the combined 
effects of a multitude of factors. The combined t ar a- 
tion can be broken down into parts associated witl 
groups of these factors and the relative importance of 
fhese^eroups as sources of variation thus be measured. 

This process, which belongs to a fairly 

of statistics is parallel to that of breaking do%\n an 

:vera“ath rL, say, into sub-averages for several 

ace groups (Table i. p. 23)- . . f 

■ ®There are quantities for measuring the form ot 

variation and^ formulae including these have been 

devisTfor describing the general character of alinos 

oil the shapes of frequency distribution that are met 

with It ’r^etps a conLquence of the uniformity 

of nature and a sign of the 

in condensing and summarizing data t^t 'Uth lou 
constants including the average and standard de\ lation. 
all the esUntial characteristics of most frequency cis- 
trlbutions can be described: that these four constants 
can contain all the essential data from observations on 
hundreds of individuals. The measures of form of 
variation are difficult to interpret m practical work, 
however, and they are mostly of value in the deve op- 
ment of the statistical theory on which practical 

statistical methods arc based. 

There are measures of association and correlation. 
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but they require experience anJ a knowledge of 
statistical theory for their full appreciation. The most 
important is the correlation coefficient which measures 
the degree of correlation. If there is no correlation 
the coefheient is zero; and if there is a perfect corre- 
spondence such that changes in one character are 
exactly related to changes in another, the coetficient is 
plus or minus i, the sign merely denoting whether the 
correlation is positive or negative. A value between 
o and plus or minus i describes a degree of correlation 
between these two extremes; the higher the value of 
the numeral in the coetTicient. the greater is the degree 
of correlation. Measures of association and correla- 
tion are important in [■•ractical statistical analysis and 
with tlieir aid conclusions can be reached from 
statistical data, that would otherwise be missed. 

Rates, ratios and percentages; index numbers; 

averages, weightcil and unweighted; measures of 

variation; measures of association and correlation; 

these arc among the most important tools of the 

statistician, bach of these describes some important 

feature (^f the data, each leaves nnich undcscribed ; 

each h.js its uses and its limitations. These quantities 

should he used carefully, as thev are so easy to misuse, 

• • • 

and it is perhaps advisable to leave their use mostly 
to the expert. But anyone may need to understand 
information expressed by them, and it is well that 
evcrx'one should know at least something of their 
meaning. 


CHAPTER VI 


SAMPLING 

The practice of taking a small part of a large bulk to 
represent the whole is fairly generally understood and 
widely used. The housewife will ‘sample’ a piece of 
cheese at the shop before making a purchase; and a 
cotton spinner will buy a bale of cotton, having seen 
only a small sample of it. The sample is also a very 
important tool of the statistician. 

There are two general reasons for working with 
samples instead of the bulk, (i) Some appraisals of 
the thing in question involve destructive tests, and 
there is no point in appraising it if the whole is 
destroyed in the process; the housewife cannot eat her 
cheese and have it. (2) It is very much more 
economical to investigate a sample than the whole 
bulk. In social and economic work, for example, it is 
usually prohibitively e.xpcnsive to investigate the whole 
field of inquiry in any detail. Even the population 
census, which has behind it the financial and coercive 
resources of the state, is made only at infrequent 
intervals (the Minister of Health refused to hold a 
quinquennial Census in 1936 partly on the grounds of 
expense), and the questions asked are few and com- 
paratively simple. If a sample inquiry is made, on 
the other hand, it is feasible to employ experienced 
field workers who can collect information that is 
comparatively detailed and elaborate, and can ensure 
that the records are reasonably accurate. 

An important example of a sample inquiry is the 
Ministry of Labour’s i 937 - 3 ^ investigation of working- 
class family budgets. It was desired, largely for the 
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purpose of constructing a new cost-of-living index 
number, to know how, on the average, working-class 
families spend their wages. It is inconceivable that 
all such families in the country could, or would, supply 
the necessary detailed accounts of their expenditure, 
so a sample of about 9,000 families was selected, and 
these were induced to keep verv full accounts for four 
chosen weeks in 1937 and 1938. By working on this 
scale it was possible for investigators to visit the 
families and help them to keep their accounts in a way 
that was more uniform and much more accurate than 


would otherwise have been possible. As a result, the 
average weekly cxpenvliture was obtained on about 90 
separate items of b)od, clothing, fuel and light, and 
otlier items such as soap anti cigarettes, holidays and 
hairdressing, tloctors and dentists, and so on. A 
ntore unusual example of the use of sampling is given 
by Sidney and Beatrice Webb who, during their 
investigatiort ot Lnglish local government, examined 
all the local .\cts in a few selected vears as a sample of 
the thousands ot acts passed between 1689 ami 1834. 

I nhjrlunately, however, the method of inquiry hy 
sample is somewhat misirustetl, sometimes honestly 
and SI inietiines. I suspect, because a sample has in 


some instance given a result the sceptic does not like. 
\ ct a sample rnoy give reliable results. For example, 
an eaithquake disaster in 1923 interrupted the tabula- 
tion ot the results ot the Japanese census of 1920 and 
interim tigures were given based on a sample containing 
i>ne family in every thousand. 'Fhese results agreed 
well with tlmse given later when the regular tabulations 
were completed. Nevertheless it must be agreed that 
samples do not represent the bulk exactly, and that 
iliey may sometimes be much in error. 
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Mr. Seebohm Rowntree’s 1941 social sur\’ey of 
York was made by visiting all households under in- 
vestigation, but in his book Poverty and Progress, 
Mr. Rowntree also gives for comparison the results 
obtained from samples taken from the full returns. 
He makes no comment on these comparisons (the 
differences are substantially what a statistician would 
expect), but a journalist commented on the figures as 
follows : ‘ Broadly speaking, they suggest that “ sample- 
r^ults” are usually within 15 per cent of the truth 
either way,* This statement is too broad to have a 
precise meaning, but in any event it is not the kind of 
generalization a statistician would make, for he knows 
that the error in a sample result depends on the size 
of the sample, on the nature of the bulk being sampled 
(particularly on the variation within it) and on the 
way in which the sample is taken. It is my purpose 
in this chapter to show how this comes about. 

In the discussion I shall follow the usual practice 
of statisticians of referring to the bulk that is being 
sampled as the population. The population in this 
chapter is to be thought of specially as contrasting 
with the sample. I shall refer only to populations 
consisting of recognizably discrete individuals, e.g. 
men or electric lamps. 

The ideal sample is the simple random one in which 
chance alone decides which of the individuals in the 
population are chosen. Suppose wc wish to obtain 
a random sample of the people of England and Wales 
in order to make an estimate of their average height. 
To do this we may, in principle, take forty-odd 
rnilUon exactly similar cards, one for each person, and 
write each person’s national registration number on 
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the appropriate card. These cards may then be put 
in a large churn, thoroughly mixed, and (say) one 
thousand cards be drawn, somewhat in the way the 
names are dra\Yn for the Irish sweepstake. The 
thousand people whose numbers are on the cards are 
a random sample, and ^\•e can measure their heights, 
find the average, and so obtain a figure which is an 
estimate of the average height for the population. 

To in\ estigate the error in the average so estimated 
we could, again in principle, subsequently measure the 
heights of all individuals in the pppulation and so 
obtain the true average. An easier thing to do is to 
draw a number of samples, each of one thousand, and 
calculate the several averages. These will xnry above 
and below the true, or population value, and the extent 
to which they varv gives some idea of the error with 
which any one sample estimates the true average. 

To do such an experiment in fact requires far 
greater resources than 1 can command, but there are 
f»ther experiments that are similar in principle and are 
easier to do. What we really want to know is how 
chance works in deciding the choice of the sample, 
and chance also operates in games of the table, with 
such things as cards, dice and roulette wheels. In 
those games, a population does not exist in the sense 
that the population of England and Wales does, but 
we may use the concept of a hypothetical population. 
Suppose, for example, we threw' a perfectly balanced 
six-sided die millions of times. We should expect 
one-sixth of the throws to score aces, onc-sixth to 
score twos, and so on, and the average score would be 
J(i + 2 + 3 r 4 + 5 + h) =s 3-^. These millions of throws 
are a population, and any thousand of them including 
the first thousand is a random sample. But the 
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millions of throws need not. in fact, be made; they 
need only be imagined as a hypothetical population, ot 
which any number of actual throws form a sample. 

To illustrate the way in which random sampling 
errors arise I have made an experiment which I need 
not describe in exact detail. The experiment is 
equivalent to that described here, which is not quite 
so easy to perform but easier to imagine. le 
imagined apparatus consists of ten packs, each o 
ten cards, the cards in each pack being numbered 
respectively i. 2. 3 ■ • • The packs shod cd 

separately, one card is drawn from each, and the ten 
numbe-rs on the cards are added to give a score^ -o 
example, the numbers might be 2. 4. 2 - “• > 9 > --- 

0, 8 and the score would then be 53. Ihcn ic c. 

are put back in their packs, the packs are resliunic-^Ll. 
and again ten cards arc drawn to give another sco c. 
This is repeated, so that a large number of scores 
results, which arc individuals from a hypothct.ca 
population consisting of the very large number o 
Ures that could conceivably be obtained. I he 
lowest conceivable score is 10. resulting from ten ace^. 
the highest is too. resulting from ten tens; and e 
true average score is 55. Now let us consi e 

results of the experiment. ^ • r 11 tKn 

It would take too much space to give m full the 

results of s really extensive expermtent, ^ut 

are given in Table 14 to show the kind of thing th. 

happens. The top part of the table gives the hr,-,t 

thirty individual scores. Chance has not guen 

score as high as loo or as low as lO. as it mig i 

have done, and presumably would have done had 

continued long enough with the experiment^ Hic 

first thirty scores vary between 36 and 72, the rang 



82 STATISTICS 

being 36. Now, in order to see what happens when 
we take samples and find the averages, I took 30 
samples, each of ten scores. Such samples are far 
too small for most statistical inquiries (although 
statisticians sometimes have to be content with small 

Table 14 

Individual Scores and Average Scores in Samples of Ten • 

and Forty 


Individual Scores 


52 

46 

72 

53 

36 

55 

42 

56 

6x 

53 

S6 

6S 

48 

54 

62 

6s 

48 

65 

61 

60 

S8 

42 

58 

46 

63 

61 

68 

S3 

54 

43 



4 

Averages of Samples 

of Ten 



52-6 

58-4 

54-6 

52-6 

48*6 

54-0 

S2-8 

50*8 

46 'O 

55-8 

53-4 

59-4 

55-0 

56-2 

6i*6 

53-6 

54-2 

568 

Sz-3 

54-0 

567 

55-2 

.s6-3 

52-3 

53-8 

57-8 

55-9 

6i'8 

58*6 

49-2 


Averages of Samples of Forty 

54- 6 SI -6 53-6 566 54-3 55-1 573 54*4 55-4 

55- 3 54-1 55-8 55*4 S^-o S3'2 5S-» 54-3 54'8 54*2 

54-3 S7-2 53-2 560 54-5 51-5 53-7 560 54-8 55-4 


samples) but they illustrate the errors of random 
sampling. The average scores are in the middle 
section of Table 14. The first average of 52*6 is 
obtained from the ten individual scores in the top 
row of the table. The thirty averages vary between 
46*0 and 6i'8, the range being 15-8, and no average 
differs from the population value of 55 by more than 
9 0. In so far as these thirty samples show the 
variations we are likely to get in the averages of the 
millions of samples we could draw, we may say that 
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the biggest error with which the average of any one 
sample of ten scores estimates the population average 
is 9-0. When I took larger samples, each of forty 
scores, I obtained results given in the lowest section 
of Table 14. They vary between 51-5 and 57-3 with 
a range of 5-8, and the biggest error with which any 
one sample of forty scores estimates the population 
average is 55-5i-5“3-5* Thus we see that the 
averages estimated from random samples vary among 
themselves and differ from the average for the popula- 
tion, but that the biggest error decreases as the size 
of the sample is increased from ten to forty ; and you 
may take on trust that this tendency would have 
continued had 1 extended the experiment to deal with 
still larger samples. I'or example, by calculating the 
average of the thirty averages of samples of forty, 
we have the average of a single sample of 1,200 scores, 
which comes to 54.8— very close to the population 

value of 55. ^ .. 

These results are shown in the frequency distribu- 
tions of Figure 16 where, instead of a frequency for 
each sub-range, there are dots, each dot representing 
an individual score or the average of a sample. Notice 
how the averages tend to be clustered more close y 
round the population value as the size of the samp e 
is increased. A frequency distribution of sample 
averages for any given size of sample is called the 

sampling distribution of the average. 

The Errors of random sampling, which in an ex- 
periment like that just described show themselves as 
variations between sample means, arise from the 
variation between the indixdduals in the original 
population. Other things being equal, such sampling 
errors are proportional to the amount of vanation in 
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the population. As an extreme example, it is easy to 
see that had there been no variation between the 
individual scores and they had all been 55. the means 



Score. 


of all samples of all sizes would have been 55 and there 
would have been no sampling errors. 

When the statistician thinks of the random error of 
the average of a sample he thinks of a whole collection 
of possible values of error, any one of which the given 
sample may have: of the sampling distribution of 
errors. The actual error of the given sample probably 
exceeds the smallest of these values ; it may easily 
exceed the intermediate values; and it is unlikely to 
exceed the very largest values. There is a whole list 
of probabilities with which the various values of error 
are likely to be exceeded, and these can be calculated 


from a quantity called the standard error. The 
standard error is a measure of the variation in the 
sampling distribution analogous to the standard 
deviation (p. 74) and for the statistician it sums up 
the whole distribution of errors. If the stand^d 
error of a sample is large, the errors to which tha 
sample is liable are. as a whole, large; if the standard 
error is small, the likely errors are small. This 
quantity, carrying with it the idea of errors occurring 
with various probabilities, should replace the cruder 
‘biggest error’ I introduced in describing the results 

of the experiment. 

It is not usually necessary to do an actual expcrimei 
to measure sampling errors, as the 

of probability enables statisticians to deduce sam^mg 
disi^ributions and standard errors theoretically. This 
method is better because it is less laborious and more 
exact giving results as accurate as an experiment 
tmo ;ing millions of samples. The results o the 
theoretical calculations are of the same kntd as those 
given by the experiment, and sonic instances the) 
have been checked by very large-scale expernnents. 

1 have considered only the sampling errors of he 
average, but the same principles apply to other 
Ltisfickl quantities such as ratios, and the measures 
of variation and correlation. The theorct.ca deduc 
tion of sampling distributions of the many stalls teal 
quantities in use is a very highly developed hranch^f 
mathematical statistics; and somet.mes the pr°'’kms 
have proved so difficult to solve that statist. cans have 
had to fall back upon actual samplmg experiments. 

With the ability to calculate errors of sampling. 

statisticians can make allowances 

making deductions from sample results. It 
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Standard procedure to examine the results of a sample 
to see how far they can be explained by random errors. 
This is called testing the significance of the results, and 
only such results as cannot reasonably be attributed to 
errors of random sampling arc held to be statistically 
significant. 

Before going on to the more practical problems of 
sampling. I will summarize the ground covered so far. 
When many samples of the same size arc taken from 
a population of variable individuals, the sample 
a\ erages show variation which may be described by a 
sampling distribution and measured by the standard 
error. A given sample of that size may have any one 
of the averages in the distribution, and the probability 
that its error will exceed anv stated value can be 
calculated from the standard error. The standard 
error of the average is a measure of the errors to which 
a sample average is liable. For a sample of given size, 
this standard error increases as the variation between 
the individuals in the original population increases; 
for a given population, the standard error becomes 
smaller as the size of the sample is increased. (For 
the sake of the mathcmaticallv minded it may be 
stated that the standard error is inversely proportional 
to tl\e square root of the nuntber in the sample.) 
Consequently the random errors can be made as small 
as w'c please by making the sample large enough, and 
for a given population it is possible to calculate the 
size of sample necessary to reduce these errors to any 
desired value. Similar remarks apply to quantities 
other than the average. 

’rhe tendency for large samples from some population 
to ha\c averages that vary little amongst thentselves 
and ditler but little from the population xalue is the 
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reality behind the popular conception of the Law of 
Averages. This law does not operate, as some people 
think, so that an abnormally high individual score or 
run of scores is followed by an abnormally low score 
or run, correcting the average by compensation. In 
a random series, the scores following an abnormal 
score or run are quite unaffected by what has gone 
before ; they tend to be nearer the general average than 
the abnormal scores are, i.c. to be more normal, so 
that when included in the average they reduce the 
effect of the abnormal scores. Averaging has more of 
a swamping than a compensating effect. Thus if we 
may regard the days of weather as individuals from a 
population, the average weather for the population 
being the general type experienced at a given time ot 
the year and place, the law of averages does not 
require that a very wet spell shall be followed by a 
very dry spell. For all I know, there may be a law 
to that effect, but if so. it is not the law of averages. 

If the individuals in a statistical population arc well 
mixed up no known method of investigation can give 
more accurate results for a given cost than the method 
of purely random sampling just described, unless 
something is known about the individuals to enable 
some sort of selection to be made. Sometimes, how- 
ever a more complex form of random sample called 
the ’representative sample gives greater accuracy. 
Suppose, for example, that in a housing surccy we 
wish to find the average number of rooms per family 
in some town. Some families at one end of the scale 
of wealth will live in one room each and at the other 
end there may be families that have say twelve rooms 
each; and this variation over a range of eleven rooms 
per family will give rise to a certain standard error in 
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a simple random sample. Suppose further that we 
can divide the town into three districts — ‘poor’, 
‘middle-class’, and ‘wealthy’ — in each of which the 
total number of families is known, and that the range 
of variation of rooms per family is from one to seven 
in the poor, from four to ten in the middle class, and 
from six to twelve in the wealthy district. Then if we 
take a random sample from any one district, the district 
average is estimated with a smaller standard error than 
that just mentioned, resulting from a range of variation 
of six rooms per family (i.e. 7 minus 1, 10 minus 4, or 
12 minus 6). Further, it can be proved that if a 
rt[yres(ututive sample of the same size is taken, in which 
the proportion of families from each district is the 
same as in the whole town, the standard error of the 
average of that sample will be the same as the smaller 
error resulting from a range of variation of six rooms 
per familv. 'Fhis is because the proportion of families 
from each district is left to chance in the simple random 
sample; iti the representative sample it is not, and 
that source of error is removed. 

Random sampling is the basis of the representative 
sample, however, which is nothing more than a 
weighted combination of rantlom sub-samples. 

RepreseiUati\c sampling Is used in the Gallup polls 
of public opinion, where care is taken to sec that the 
opinitms of various classes of people are represented 
in appropriate proportions instead of leaving it to 
chance to determine what these proportions shall be. 

If it is granted that the ideal random sample can be 
a reliable instrument of investigation, the questions 
remain: Can the ideal be attained? Are the actual 
samples that are used as reliable as random samples? 
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As a random sample is increased in size, it gives a 
result that progressively comes closer to the population 
value, whereas samples taken in some of the t\a\s 
that are used give results that progressively come closer 
to some value other than the population value, results 
that may for some kinds of samples be too high, or or 
others too low. A sample of this kind is said to be 
biased, and the difference between the value given by 
a very large sample and the corresponding population 
value is called an error of bias. biased die, for 
example, is one for which the fraction of throws 
showing an ace, say. tends to a value other ffian 
one-sixth (the value for the hypothetical population), 
and the greater the number of throws, the clearer is 
it that the fraction of actual aces is not one-sixth. 
Errors of bias arc added to the random errors, and 
since thev follow no laws from which they can be 
calculated, they must be eliminated entirely or reduced 
so that they become unimportant. I his may ne 
difficult to do, and it is often necessary to use very 
elaborate sampling methods to avoid errors of bias. 

It is nearly impossible for anyone to select mdividua s 
■ at random without some randomizing apparatus, it 
a teacher tries to select a few children from a class, 
he will tend to choose too many clever ones, or dull 
ones, or average ones; or if he tries to be random he 
may select too many clever and dull children and too 
few intermediate ones. In selecting a sample ot 
. houses ‘at random’, the investigator will be very 
unlikely to select an>thing like the right proportions 
of large and small ones, shabby and smart ones, new 
and old ones, and so on. Bias almost inevitably will 
creep in. This is illustrated by the results of large 
experiments conducted on several thousands of school 



go STATISTICS 

children in Lanarkshire in 1930 to measure the effect 
of feeding them with milk, on their growth during the 
period of the experiment — about six months. At each 
school the children were divided into two comparable 
groups ; one group received the milk and the other did 
not, and the effect of the milk was to be measured by 
comparing the growth rates of the two groups. The 
results for a number of schools Nvere combined. In 
an experiment of this kind, the accuracy depends very 
much on the two groups or samples of children being 
similar o!\ the average before the feeding with milk 
begins, i.e. on one being unbiased with respect to the 
other. 'I'o secure this, the children were selected for 
the two groups cither by ballot or by a system based 
on the alphabetical order of the names. Usually, 
these arc both good wavs of making unbiased random 
saii'.plcs of the two groups, but the whole thing was 
spoilt by giving the teachers discretionary powers, 
where either method gave an undue proportion of 
well-fed or ill-nourished childreit, ‘to substitute others 
to obtain a more level selection’. Presumably the 
substitution was not done on the basis of the actual 
weights of the children, but was left to the personal 
judgement of the teachers. The result was that at 
the start of the experiment, the children in the group 
that were later fed with milk were smaller than those 
in the other group, the average difference being an 
amount that represented three months of growth. It 
has been suggested that teachers tended, perhaps sub- 
consciously, to allow their natural sympathies to cause 
them to put into the ‘milk’ group more of the children 
who looked as though they needed nourishment. 
This bias did not ruin the experiment, but unfortun- 
ately the interpretation of some of the results was left 
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somewhat a matter of conjecture instead of relative 
certainty, and there was later a certain amount ot 
controversy about some of the interpretations. I he 
substitutions of the children could have been done 
without introducing bias had the actual weights been 
made the basis, and there would have been an im- 
provement on the purely random sampling; but by 
unwittingly introducing the bias, it seems that the 

teachers actually made matters worse. ... % 

A sampling method that is very liable to give biased 
results, particularly when testing opinion on contro- 
versial matters, is that of accepting voluntary returns. 
An undue proportion of people with strong views one 
way or the other are likely to make the returns, and 
people with moderate views are not so likely to take 
the trouble to represent them. Tor this reason, the 
post-bags of newspapers and Members of Parliament 
do not give random samples of public opinion. 

A spectacular example of a biased sample is pnn i e 
by the attempt of the American magazine, the Literajy 
I Digest, to forecast the results of the Presidential 

election of 1936 by means of a ‘straw vote . Some 

I ten millions of ballot post cards were sent to people 
whose names were in telephone directories and lists ot 
motor-car owners, and several million cards were 
I returned each recording a vote for one of the candi- 
dates. Of those votes, only 40 9 cent were in 
favour of President Roosevelt, whereas a tew weeks 
later in the actual election he actually polled 60-7 
per cent of the votes. Those from among telephone 
users and motor-car owners who returned voting cards 
did not provide a random sample of American public 

opinion on this question. 1 u 1 

Bias does not result only from obviously bad 


D 
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sampling methods; it may arise in more subtle ways 
when a perfectly satisfactory method is modified 
slightlv, perhaps because practical cortditions make 
this necessary. In some Ministry of Labour samples 
of the unemployed, a i per cent sample was made 
by marking every hundredth name in the register of 
claims, which was in alphabetical order. Bias was 
introiluccd bv not confining the inquiry to the marked 
names; instead, the first claimant appearing at the 
Kxchange whose name was marked or was among the 
five names on either side of the marked one, was 
interv iewed to provide the necessary data. Claimants 
who arc in receipt of benefit attend at the exchange 
several davs in a week, whereas those whose claims are 
disallowed but who are maintaining registration only 
atteiul once a week. The effect of this and of the 
latitude allowed in the choice of persons for interview 
was that too many claimants in receipt of benefit were 
included in the sample. It was only when the exist- 
ence of this bias was realized that some of the results 
that were app.iretulv inconsistent with other known 
facts made scn.se. A similar kind of effect can arise 
in surveys of households if no one is at home when 
the investigator first calls at some house chosen to be 
one of tlie sample. Such houses are likely to contain 
small families with few or no young children, since in 
large households someone is almost certain to be at 
home to answer the door ; and unless the houses with 
no one at home are re-visited, the sample will be 
biased in respect of size and character of household. 

Although there is no general theory of errors of bias 
l^y w hich tlic anvount of such ernirs can be calculated 
in any particular instance, as can be done for random 
errors, statisticians do not work entirely in the dark. 
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Sometimes the sample gives, as part of its results 
information that is also known accurately from a lull 
census, and the sample is usually regarded as tree frorn 
bias in all respects if in this one respect it agrees with 
the census. The soundness of the results ot a sample 
inquiry may sometimes be checked by comparing them 
with data obtained in other ways, perhaps by other 
investigators. Where none of these checks are avail- 
able, it may be necessary to rely on the statistician s 
general experience of sampling methods m deudi g 
whether the sample in question is a good one hav e 
given enough examples to show that a good deal is 
Lown of the ways in which errors of bias arise, and 

what must be done to avoid them. 

It is implicit in my definition of errors ot bias that 

they cannot be ‘ drowned * by taking very large samples 
in the way that random errors can; a fact that t 
experience of the LUerary Digest's straw-vote on the 
American Presidential election of '936 ply confirms 
From this point of view, a good sample can be s rned 
at only by employing a good sampling method I hav e 
TeJy mentioned some methods inc.dentally, and ,t 
is only necessary here to give it as a wanting that tv hen 
a statistician advises adherence to an elaborate method 
with a closeness that may seem to the layman to b 
■fussy’, that advice had better be followed; failure to 
do so has been known to lead to biased results. 

Altogether, the method of inquiry by sample is 

difficult and full of pitfalls. But statisticians cou d 

not get on without it and experience of its use is both 
wide and deep, so that in competent hands the method 
is capable of giving results that are reasonably accurate. 
MorLver, the inevitable errors in the results can be 
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estimated, and allowance can be made for them in 
arriving at conclusions. 


CHAPTER VII 

TAKING ACCOUNT OF CHANCE 

Chance operates in many fields besides that of random 
sampling, and many of its effects can be calculated by 
applying the same general methods as are used to 
calculate the errors of random sampling. Some of the 
further applications of those methods will be described 
in the present chapter. 

The effects of chance can be calculated only because 
they ft)llow certain laws, but these differ in kind from 
the exiirt laws of subjects like physics. Events that 
follow exact laws can be described or predicted 
precisely; but we can only specify probabilities that 
chance events will occur, or specify limits within 
which chance variations will probably lie. Newton’s 
laws ut motion, for example, are exact because they 
describe exactly the relations between the motion of 
bodies anti the forces acting upon them; the errors of 
random sampling follow chance laws because we cannot 
predict exactly what average a random sample will 
have ; we can only state, as I have suggested on 
p. 84, the probability that it will lie within certain 
limits. 

1 cannot embark upon a full discussion of what we 
mean by chance, but as a preliminary I shall indicate 
a few ideas associated with the word. Statisticians 
attribute to chance, phenomena (events or variations) 
that are not exactly determined, or do not follow 
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patterns described bv known exact laws, or are not the 
effects of known causes. That is to say. the domain 
of chance varies with our state of knowledge— or 
rather of ignorance. Such ignorance may be funda- 
mental because the relevant exact laws or causes arc 
unknowable; it may be non-essential or temporary, 
and exist because the exact laws do not happen to ha\ e 
been discovered; or the ignorance may be deliberately 
assumed because the known exact laws and causes arc 
not of such a character that they can profitably be used 

in the particular inquiry in hand. 

An example of ignorance that, according to present- 

day ideas, is fundamental, is m the Principle of Indeter- 
minacy of modern physics; we do not and cannot 
know the precise motion of an electron. We do not 
know what determines the position of a shot on a 
target, but that ignorance is non-essential and in some 
decree temporary. The variations in the positions of 
rii shots depend on a host of factors such as variations 
in the primary aim of the marksman, the steadine^ of 
his haL, the weight, size, and shape of the bullets 
the propelling charges, the force of the wind, and so 
on- but presumably these factors can be investigated 
and laws be discovered. Indeed, this has happened, 
and the history of gunnery shows the temporary 
character of the ignorance. Gunnery is much more 
of a science and more exact than it wp m the days of 
the Battle of Waterloo, or even during the 
War- and as knowledge has increased, unpredictable 
variations in placing of shots have ^een reduced - but 
at each stage these variations are regarded as due to 
chance Ignorance of causes is assumed by an insur- 
ancrcompfny in using its past experience of acc.den 
claims to establish future premiums for motor-ear 
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insurance. The company has considerable knowledge 
of the circumstances surrounding ever)' accident on 
which a claim is made, but is unable to make more 
than limited use of that knowledge, and so treats 
accidents largely as cliance events, except for a few 
special allowances such as ‘no claims bonuses’ or 
extra premiums charged to people with bad accident 
records. 

L'suallv, events regarded as coming within the 
domain of chance are those governed by a complicated 
system of many causes, each of which produces only 
a sniall variation; and one frequent characteristic of 
such c\cnts is that small changes in the circumstances 
surrounding them make a big difference to the results. 

Chance as I have described it operates in a very wide 
field, covering the whole of tlic unknown; but mathe- 
matical calculations can be made and chance laws he 
propoundcil only for comparatively simple systems 
co\cring a portion of this field. Nevertheless, such 
calculations have a wide range of usefulness, which the 
following examples will illustrate. 

One use of chance calculations is for deciding which 
of the fluctuations in a time series arc random and 
which arc trends having some significance. As an 
example of a time series, consider the unemployment 
data represented in Figure 3 (p. 30). Readers will 
have no difficulty in recognizing the broad changes, 
viz. the minor waves in 1925 and towards the end of 
1 928, tlic large upward sweep in 1930, the improve- 
ment from the end of 1932 to the end of 1937, and the 
upward movement again in 1938. For the time being 
we shall omit 1926, the year of the General Strike, as 
being exceptional. These changes are reasonably 
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attributed to fundamental causes that operate fairly 
slowly and may be represented by a 
drawn through the actual points of the graph, 
te a numbfr of ^mathematically dctcrnm.ned corm e 
that have the property of changing m level m such a 
regular way" and are of the nature of exact laus 

“ ll^rlmagine such a curve to he drawn through 
the potnts of Figure 3. Then the actual ponus u 
be seen to deviate from this curve. 1 hc> 
some degree follow a seasonal pattern 

hard winter, and so on. ^ assume ignorance, 

knowledge into of exact laws 

Thirare cLtde“deviations to he due 

that are relevant, .u„nce causes that operate we 

m a complex system chance 

know not h * ^ chance and sampling 

as desen S argument for applying the 

:S; of errom of random ® ampling to testing the 
^tatisdcal significance of fluctuations in time scries. 
pTr e^mplc, there was a sudden and temporary rise 

in unemployment in the beginning of 1936. 

•fi ont? It actually occurred, and therefore is 

S T..‘r 

And 80 we apply the theory of random errors. If this 
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theory had been applied to testing the statistical 
significance of the sudden rise in unemployment in 
i92(> it would have shown the operation of something 
unusual — we know that to be the General Strike. 

This kind of application of the theory' enables us, 
in retrospect, to decide whether any particular events 
with which we trv to associate fluctuations have had 
important ctTects compared with the system of random 
flucttiaiions. When following changes week by week 
nr mt'nih by month as tliey occur it is useful, too, to 
he able to decide whether the last increase or decrease 
is large enough to call for action, or whether it is 
random. At one period, for example, a local news- 
paper used to publish weekly figures of deaths due to 
roail accidents in a certain town, and the number used 


to fluctuate about an average of four or five per week. 
Should \\c worry if between two particular weeks the 
number rises from three to six, or rejoice if it falls 
from fi\e to two: No! Such changes arc no greater 
than any tliat can be attributed to chance, and do not 
Indicate a real change in conditions. Sometimes the 
chance cc-incii!ence of random fluctuations may give 
rise to several consecutive small increases or decreases, 
giving a sptirious appearance of a trend. Sampling 
theory c.m show when such is the case. 


I o arrive at results of these kinds, it is necessary' to 
analyse the time series so as to separate the random 
fluctuations from the secular movements; and addi- 
tional complications occur if the system of random 


fluctuations changes. Some would say, for example, 
that in iraile the random fluctuations during a slump 
and a br)om are different. The whole analysis is only 
appro.vimate, but it is based on ideas that are sufficiently 
close to reality to give useful results. 
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The theory of random errors was used for measuring 
the accuracy of astronomical measurements long before 
it was applied to statistical samples, and -t is used 
somewhat in measuring the errors of experimental 
obser\'ations in general. When the astronomer 
measures, say, the position of a star, he finds that in 
spite of the precision of his apparatus, and the care 
with which he adjusts it and makes his obscr\'ations, 
he does not get the same answer from successive 
determinations. He repudiates the idea that the 
position is varying and attributes the variations in his 
results to unavoidable errors of observation. The 
question arises: What is the true position? .-Vnd if it 
cannot be measured exactly, how accurately can it be 
estimated? A similar situation arises in the other 
so-called exact sciences: e.g. in physics and chemistry. 
Several determinations have been made of the velocity 
of light, but they do not agree exactly; and a chemist 
would be very surprised if he got exactly the same 
result every time he measured an atomic weight. 

This interpretation of experimental results as being 
due to an invariable quantity plus observational or 
experimental errors is purely a mental conception. 
The only reality is the set of obsers’ations, the character- 
istics of which can. if desired, be expressed by any 
statistical constants such as the average, or a measure 
of variation, or by a frequency distribution. For most 
experiments, however, it is useful and (within limits) 
valid to adopt the more common conception. 

The errors do not follow any known exact laws, and 
so the laws of chance are sometimes used to describe 
them. In applying these laws, the results arc regarded 
as a random sample from a hypothetical population of 
results, the average of this population being the true 
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value. Then, the average of the sample is an estimate 
of the true value, and the error in that estimate can be 
calculated as for any statistical sample. Is this idea 
valid ? On the face of things, it seems as reasonable 
to imagine the millions of results that would have been 
obtained had the experiment been repeated millions 
of times under the same conditions as to imagine the 
results of millions of tosses of a die. But it is not so 
certain that the variations between experimental 
results arc entirely of the same kind as those we get 
when we toss dice. 

On this question there are differences of opinion 
among experimentalists. Some refuse to admit any 
similarities between experimental and random errors. 
Others, faced with otherwise intractable results, use 
the theory of random errors as the only way out. 
Experimental errors are not, in general, random. 
There are ‘ personal ’ factors, and any one person shows 
a bias that changes from time to time. I prefer to 
regard a set of experimental results as a biased sample 
from a population, the extent of the bias vai’jing from 
one kind of experiment and method of observ'ation to 
another, from one experimenter to another, and, for 
any one experimenter, from time to time. If this view 
is accepted, experimental errors can be regarded as 
forming a chance system, but the system is not as 

* 9 

simple as that assumed in calculating the errors of 
random sampling. 

In general the bias cannot be estimated and the 

errors is therefore not enough. 
Sometimes, however, one can say that the bias is 
likely to be small compared with the random errors, 
and then the theory if 

results. For examplej-#^5S^iwe chemists 
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were to determine the atomic weight of an element 
independently, in different times and places and 
possibly by different methods, the results would vary 
because of the effects of random errors and bias. But 
the separate biases for the five chemists would differ 
and so would appear as the random errors between the 
results, the group as a whole would probably cNhibit 
but little bias, and the theory of errors would Provide 
a reasonably close measure of the precision with which 
the average of the five results estimates the true atomic 
weight. This might not be so, on the other hand, tor 
the average of, say, twenty consecutive determinations 
made by one chemist in one laborator> . 

Errors of bias are often relatively unimportant wl'.en 
the obsei^ed quantity is the difference between two 
similar quantities. In measuring the distance between 
two lines in a spectrum, for example, the mam error 
is often due to the uncertainty of setting the cross- 
hairs of the measuring microscope on the centres of 
the lines If there is a bias in doing this, it is likely 
to be similar for the two lines (provided they are not 
too dissimilar in width and appearance), and the 
difference in the two settings will probably be practi- 
cally unbiased. The theory of errors gave a result 
that was at least qualitatively right, when applied to 
Lord Rayleigh’s measurements of the density of 
nitrogen. He made a number of determinations on 
‘atmospheric’ and ‘chemical’ nitrogen and found a 
difference in the two averages. Subsequent trc*atmcnt 
by the theory of errors has shown that the diticrencc 
is ercaicr than can be attributed to random variations, 
and this result is in accordance with a real difference 
w'c now know to exist, owing to the presence of the 
rarer inert gases in ‘atmospheric nitrogen. 


102 


STATISTICS 


Where the bias is completely unknown, I doubt if 
it is possible to do more than hope that the true value 
lies somewhere between the highest and lowest of the 
actual values, and regard the average as an estimate of 
the true value, that is as good as, but no better than, 
any other single estimate that could be made from the 
data. It is, of course, the experimenter’s job to reduce 
bias and random errors to a minimum. 

To sum up, the theor>' of random errors may be 
usefully applied to some experimental obser\'ations, 
particularly of differences in values, but groat caution 
must be cji^servcd on account of bias. Certainly such 
an application is no substitute for careful experimental 
control. 

Much experimental work, particularly in biological 
subjects, is now done under conditions, many of which 
can be well controlled, and the obser\’ations can be 
made accurately; but the material is inherently 
\’ariabtc and the results have to be treated statistically. 
1 be Lanarkshire experiment made to measure the 
effect n^ilk on the growth of children, already 
mentioned on p. 90, is of this kind. The amount of 
tnilk ted can he controlled, children fed and not fed 
with milk can he kept in the same environment, and 
the cl>anges in weight can be measured accurately; 
but it Would not do to base conclusions on an experi- 
ment ou. say, two children. Cliildren vary’, and it is 
necessary to obse^^■c a large number and take averages. 

'I'hc problem of interpreting the results of such 
experiments is essentially statistical, and it has fallen 
to the l«>t ot statisticians to study the general questions 
of arranging experiments with variable material, of 
drawing conclusions from the results, and of testing 
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them. Under the leadership of Professor R. A. Fisher, 
who started this work at the Rothamsted Experimental 
Station (for agriculture), an elaborate technique for 
doing this has been developed and is \ ery widely used. 

I propose to give some description of this subject. 

There are three main principles to be obsen-cd in 
designing such an experiment; they are replication, 
randomization, and economy in arrangement. 

The necessity for replication has already been stated. 
The problem first arose chiefly in agricultural field 
trials made to measure such things as the effects of 
various fertilizers on wheat yield. It was early seen 
that different plots treated in the same way gave 
different yields. Hence, it was not sufficient to liave 
two plots, say, to treat one with a fertilizer, to grow the 
crops and measure the yields, and to regard the 
difference as measuring the effect of the fertilizer. 
The experiment had to be replicated by treating 
several plots in each way and measuring the difference 

between the average yields. , ^ , , 

Even differences in such averages can be affected by 

variations between plots, as we can sec from the 
results of the sampling experiment described m the 
last chapter ; and it is desirable to estimate the accuracy 
of the observed difference. The only known way of 
doing this is by the theory of random errors. It was 
found, however, that variations in plot fertility were 
not random. There was usually a fertility pattern, 
e.g. a gradient in fertility across the field. In order 
that the theory of sampling 'could be applied, an 
element of randomization was introduced artificially 
by using some such device as a ballot to decide which 
plots should receive the various experimental treat- 
ments. This is a ‘trick- of the trade’ for making 
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fertility variations into a comparatively simple chance 
system. A statistician might apply this principle to 
the above-mentioned experiment of feeding milk to 
school children by tossing a coin once for each child, 
giving that child milk if the result is ‘heads’, say, and 
no milk if the result is ‘tails’. 

'fhe pattern in fertility differences bet\veen plots in 
a field was used to increase the accuracy of experi- 
mental comparisons. Adjacent plots tend to be more 
alike than those in different parts of the field, and by 
comparing the treatments on adjacent plots the random 
variations affecting the comparison were reduced, with 
an increase in accuracy. The other way of increasing 
accuracy is to increase the number of plots, and hence 
the expense of the experiments ; the arrangement 
usitig adjacent plots is therefore more economical. 
In the same way, had it been possible in the Lanark- 
shire milk experiment to use identical twins, giving 
milk to one of each pair, far fewer children would have 
given the same accuracy as thousands chosen at 
random. This kind of arrangement can be made to 
satisfy the condition of randomness sufficientlv for the 
application of the theory of random errors in an 
appropriate form. 

The above are the elementary principles of the 
modern approach to the design of what I shall 
term stotistical-ixpcrimentat investigations. The whole 
subject has, however, become very complicated as 
several treatments of one kind have been included, and 
treatments of several kinds. Thus, experiments may 
be done with various quantities and combinations of 
several kinds of fertilizer on several varieties of wheat. 
Further complication arises when experiments are done 
on different farms and in different years, and it is 
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necessary to consider to what extent results ohtamed 
on one group of farms in one year apph to other 

S^the fact that sound methods are available 
on'^non-statistical lines, and they get 

which they cannot fit into a system. Different 

rr^etirnes get different results in the same subject, 
and controversies arise. When, in 

the experimenters turn to sound methods of statv^tlcal 

analys s, involving proper experimental 

difficulties of these kinds tend to disappear. 1 h n. 

experiments which were previous y ^ 

adequate scale arc increased m size often the> are 
designed more economically than befo , 
advancement of knowledge is made more orderl, and 

“stadstical methods are often regarded as “PP'yj|'g 

-\‘“io'n7r’Ze "“f :::url tr r 'ct^d;' m 

reoheate some experiments hundreds and thousands 
of^times. and statisticians have had to make do wi 
11 r.iimhpr<; They havc» however, developed the 

t^emy oTermrs tl a'/ply to'small samples as well as 
to large ones. 

There are many chance events that occur in life, to 
whkh the general theory of random errors may m 

^°F:r‘‘l-pfe,“'’::I:ny telephone subscribers have 
access to one tmnk line, and a multitude of cau^s 
determine how many will want to use .t a. any g wen 
in«;tant i e it is to some extent a question of chance 
"eV more than one subscriber will want to use 
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it at once and thus cause delay. In so far as this is 
true, the extent of delays of this kind can be calculated 
from the theon,' of probability which is the basis of 
the theory of errors. This is topical of a number of 
congestion problems that arise in telephony, in road 
and rail traffic, and so on ; and although many of them 
are difficult mathematically, the theory is being 
applied. 

Accidents have a large clement of chance in their 
causation — the circumstances preceding a ‘near shave’ 
often dilTcr by only a hairbreadth from those preceding 
a catastrophically fatal accident, and the theory’ of 
probability has been useful for studying accident 
problems in calculating the effects of chance and 
showing the importance of other factors. The follow- 
ing is an example. 

Records were kept of the numbers of accidents that 
happened during the course of one year to 247. men 
workers engaged in moulding chocolate in a factory’. 
Some of the men had no accident, some had one, some 
twf), and so on, a tew having as many as twenty-one 
accidents. 'Fhe data are arranged in a frequency 
distribution in the first two cfilumns of Table 15. 
Now we ask: Were all the variations betiveen the 
men in the numbers of accidents they suffered due to 
chance, or were there differences between the men in 
their tendency to have accidents? Were the 42 men 
who had no accidents exceptionally skilful or just 
lucky; and were the 22 men who had ten accidents or 
more clumsy or unlucky? The average number of 
accidents per man is 3.94, and even if all the men 
were equally skilful in avoiding accidents, chance 
would give rise to some variation. It has been 
calculated from the extended theory of random 
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sampling that this variation would result in the 
frequency distribution of the last column of figures ot 
Table 15. This is very different from the actual 
distribution. We may say, roughly, that 5 of the 42 


Table 15 

Frequeucy Distribution of Men t.ho had Various ^unAen 
of Accidents. Comparison between Actual and Chame 

Distributions 

(Data by E. M. Ncwbold. Report No- 34. Industrial 
^ Fatigue (Now Health) Research Board) 
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men with no accidents were lucky and the remaining 
-in skilful ; that one of the 22 men with ten or more 
accidents was unlucky and the remainder clumsy. 
Comparisons of this kind between actual and calcu- 
lated chance distributions have led to investigations 
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tliat have shown how people differ in ‘accident 
proneness’, i.e. in their tendency under given circum- 
stances to suffer accidents. The chance distribution 
given in Table 15 is calculated by assuming a very 
simple system of chance variations ; more complicated 
systems taking into account variations in accident 
proneness have been used in the more advanced 
investigations on the subject. 


CH.-XPTER VIII 

STATISTICAL LAWS 

The central problem of statistics is dealing with groups 
variously described as collections, crowds, aggregates, 
masses, or populations, rather than with individual or 
discrete entities; with events that happen on the 
average r)r in the lc)ng run rather than with those that 
happen on particular occasions; with the general 
rather than with the particular. A fuller considera- 
tion of this aspect ot statistics is the subject of the 
present chapter. 

Again I shall use the language common in statistical 
writings and refer to populations of individuals. The 
population is regarded in Chapter VI as something 
from which samples are taken, but here as an aggregate 
ol indiN'iduals, winch will in most instances be repre- 
sented by a sample, i.e. I shall not distinguish between 
the population and the sample. 

The population has characteristics and properties of 
its own. which are essentially derived from and are an 
aggregate of those of the individuals, although the two 
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sets of properties may be different m kind. In the 
population, the individuals merge and their individu- 
ality is dissolved, but from the dissolution rises a new 
entity like a phoenix from the flames. The population 
is at the same time less and more than the totality o 

the individuals. . . ^ 

This conception is not peculiar to statistics 

Rousseau, for example, distinguishes in The 

Contract between the General Will and the wills of all 


the people: , 

‘In fact, each individual, as a man, may have a 

particular will contrary or dissimilar to the genera 

will which he has as a citizen. His particular I'^^crcst 

may speak to him quite differently from the common 

‘There is often a great deal of difference between 
the will of all and the general will ; the latter considers 
only the common interest, while the former takes 
private interest into account, and is no more than a 
Lm of particular wills: but take away from these 
same wills the pluses and minuses that cancel one 
another, and the general will remains as the sum of 

the differences.’ 


The general idea is expressed in another way 
the following passage from Old Junk by Mr. H. M. 

Tomlinson: „ 

‘His shop had its native smell. It was of coffee, 

spices, rock-oil, cheese, bundles of wood, biscuits and 

iute bags, and yet was none of these things, for their 

separate essences were so blended by old association 

that they made one indivisible smell, peculiar, but not 

unpleasant, when you were used to it.’ 
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The loss of individuality results from the method 
of the statistician in confining his attention to only a 
few characteristics of the individuals and grouping 
them into classes. Consider a married couple, say 
Mr. and Mrs. Tom Jones. .\s a couple their indi- 
vidiialitv consists in a unique combination of a 
multitude of characteristics. Mr. Jones is tall and 
thin, is aged 52 years, has brown hair turning grey, and 
is a farmer. Mrs. Jones is called Mary and at 38 
years is still handsome ; she is blonde and is really a 
little too ' flightv ’ for a farmer’s wife. The couple 
have been married for 16 vears and have three 
children: two boys ag^ J 14! and 11 years, and a' girl 
aged 2. In addition lu liicse and similar attributes 
the couple have a number of moral and spiritual 
qualities that we may or may not be able to put down 
on paper. It is by all these, and a host of other 
qualities that their relatives and neighbours know 
Mr. and Mrs. Jones; the uniqueness of the combina- 
tion of (jualitics is the individuality of the couple. 

The statistician who is investigating, say, the ages 
ot husbands and wives in England and Wales is 
interested only in the ages, and does not wish to 
describe c\en these accurately. So he puts our 
couple in that class (Table 8, p. 51) for which the age 
of the husband is 45-55 years and that of the wife is 
35“45 years. Mr. and Kirs. Jones are now merely 
one of a group of some 320,000 other couples, and are 
indistinguishable from the others in their group. 

Statistical investigations arc not always confined to 
one or two characters of the individuals, and elaborate 
methods have been developed for dealing with many 
attributes, e.g. the ages of married couples at marriage, 
income, number of children, fertility of the grand- 
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nai-ents of the children, and so on, but however many 

r o. 

nopubtion of individuals is the most characteristic 

“ ==iS£"= 5.? = 

concerned with in rlasscs. However 

characters show the variation between 

much we analyse the data to 

the parts, we sti « individuals. In 

averages; wc never Kf example, we 

studying the , average into sub-averages 

may decompose the general aver g h>r 

for the two isse^; but 

different localities. ’ j meaning. When 

t^r^rtattr wS of ° mass of vatiahlc 
rndividuab tier than of one or two being very 
different from the m that this part 

of ^:tiirM tc^niq" in 

"i u ;•_» f " 

the development particular, if 

„,uch wc know of Mr. J" P„ drawing 

rn^sronTaCt married couples In genera,. It is 
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only by paying attention to such features as individuals 
have in common with others that we can generalize. 
Individuals are important, as such, to themselves, tp 
their neighbours and relations, and to professional 
consultants— the parson, the doctor, and the 
they l.ave no importance for the statistician, nor indeed 
for’anv scientist, except that they, with a host of other 
individuals, provide data. 

Our first and. for most of us, our only reactions to 
cur en\ironmcnt arc individualistic. \Ve me 
viduals, our experience is mostly with individuals, and 
c\en when considering a group we arc consemus 
mostly of our personal relationship to it. The 
concept of the population as an entity does not come 
easily, and our ordinary education docs little to correct 
this defect. The mental ctTorl required to realize this 
ccuicept is perhaps something lihe that necessary to 
appreciate a fugue with its contrapuntal pattern, as 
compared with the ease of following a tune with simple 
harmonics. 

The characteristics of the population are described 
by fretjucncies and bv the statistical constants and 
a\eiapes already described, but it is apparently so 
ilitficiih to think of the reality behind these constants 
the mass of indi\ iiluals — that wc personify the popula- 
tion and speak in such terms as ‘the average man . 
This is only pi>ssiblc because of a similarity between 
some of the measures of a population and those of an 
ii. dividual ; the average height of a group of men is 
expi'essctl in feet and inches, just as the height of one 
man is; hut the similarity is only superficial. 

We have already seen the inadequacy of the average 
as a description of variable material (Chapter \ ). hilt 
the average individual sometimes is also a rather 
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absurd figure. In 1938. for example, he was among 
those comparatively rare individuals who died at 
age of 58 years. His age in England and Wales m 
1021 was 29.9. ^nd in 1938 it was 33 6 years; i.e^ m 
17 years the average man aged by only 3-7 \ears. Ihe 
average family can have fractions of a person. Books 
the upbringing of babies usually contain a curve 
showing the growth in weight of an average bab> , 
but few actual curves are like that. The curve or a 
real baby may be above or below that for the average 
and it may have a different slope in various parts, it 
will also usually have 'kinks’ due to teething troubles 
and minor illnesses, whereas the curve for the average 
baby is fairly smooth; this paragon among children 

‘"v'iatiofirof course. 
of populations that individuals cannot 
already been at pains to describe this m Chapter \ 
and to^point out how. for example, the deviations from 

any rehtionship shown by a contingency 

LlL are as chLacteristic of the data as the relation- 

ship itself. Indeed, without variation, a^ollection of 

individuals is scarcely a population in the 
sense A thousand exactly similar steel bearing balls 
if such were possible) would be no more than one 
ball multiplied one thousand times. It is the quahtj 
of variation that makes it difficult at first to carry in 

mind a population in its complexity. 

All the special properties of populations I have 
considered arise in aggregates of independent indi- 
viduals, but there are additions characteristics due to 
interactions between individuals. The behaviour of 
men in the mass is often different from their behaviour 
as individuals. Some men affect (or ‘infect ) other;. 
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and such phenomena as mass enthusiasms and panics 
arise. We speak of mass-psycholog>-. Similarly the 
effect of an infectious disease on a community of 
people in close contact is different from its effect on 
a number of more or less isolated individuals. Statis- 
tical description can take account of interactions 
between individuals, hut it is seldom necessary to do so. 

Although the individuals in a population vary, the 
characteristics of the population itself are very stable. 
Sir Arthur Eddington has well said: ‘Human life is 
proverbially uncertain; few things arc more certain 
than the solvency of a life-insurance company.’ This 
means that we do not know when any individual will 
die, but an insurance company can estimate the 
incidence of death in its population of policy-holders 
with great accuracy. 

n'his contrast between individualistic variability and 
statistical stability, and the fact that the latter emerges 
from the former, this apparent paradox of order coming 
out of chaos, has from time to time given rise to meta- 
physical speculations. People in the eighteenth 
centurv, accustomed to considering the variations 
between indi\ iduals, scent to have been struck by the 
statistical regularities and saw evidences of a Divine 
order. Sir Arthur Eddington, on the other hand, 
presumably taking for granted the regularity of the 
laws of physics, is more struck by the compatibility 
with these laws of the unpredictable variation in the 
behaviour of individual electrons, and offers comfort 
to those who want to believe in free will and scientific 
law at the same time. The practical statistician may 
accept it as a fact requiring no special metaphysical 
explanation, that mass regularities can often be 
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discerned where the individuals apparently follow no 
regular laws. 

Galton writes of the regularity of form of the 
frequency distribution in the following terms : 

‘I know of scarcely anything so apt to impress the 
imagination as the wonderful form of cosmic onler 
expressed by the “ Law of Frequency of Error”. The 
law would have been personified by the Greeks and 
deified, if they had known of it. It reigns with 
serenity and in complete self-effacement amidst the 
wildest confusion. The huger the mob. and the 
greater the apparent anarchy, the more perfect is its 
sway. It is the supreme law of Unreason. Whenever 
a large' sample of chaotic elements arc taken in hand 
and marshalled in the order of their magnitude, an 
unsuspected and most beautiful form of regularity 
proves to hsve been latent all along. 

Let us re-examine the data from the sampling 
experiment described in Chapter VI and see if we can 
repeat Galton’s experience and recapture something 
of his mood. 

I have extended the experiment to obtain 4,000 
scores altogether. The first thirty arc given in the 
top part of Table 14 (p. 82) in the order in which they 
occurred, and these together with the 3.970 other 
scores arc the ‘large sample of chaotic elements’— and 
chaotic they undoubtedly appear. 1 then procccdci 
to marshal the scores in the order of their magnitu e 
by forming a frequency distribution, and stage by 
Stage stopped to look at the result as the distribution 
began to grow. The results for 50, 200, i.ooo. and 
4,000 scores are in Figure 17. Since the scores arc 
whole numbers, I have not grouped them into sub- 
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ranges; the scales of the distributions in the vertical 
direction have been reduced as the numbers of scores 
have increased. At 50 scores, there is no sign of any 
regularity or form in the distribution, but at 200 
scores, a vague suggestion of a form seems to be 
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emerging; the scores show a slight tendency to pile 
up in the middle of the range. At 1,000 scores, the 
form is clearly apparent, although irregularities are 
still pronounced; but at 4,000 scores, the ‘most 
beautiful form of regularity’ is there, almost in per- 
fection. It is not dilhcult to imagine the regularity 
that would be apparent were the sample so large as 
to be indistinguishable from the population. 
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The formulae and laws that describe populations 
and their behaviour as opposed to individuals are 
termed statistical laws. The various statistical con- 
stants mentioned in Chapter V are e emcntar> 
statistical laws. Other laws of a higher order of 
complexity describe how populations change with 
time or place, or other circumstances. Lhms o 
heredity, for example, are a way of describing how 
some characters in populations of plants or animats 

chance from generation to generation. 

Some statUtid laws are discovered by s.m^e 

observation of the population as a ‘ ^ 

example, the change in the death-rate for the country 

ntay L recorded from year to year, 

sideration being given to the changes m ‘''c c “ces 

of death from various causes, to winch 

is exposed. A public lighting “ 7 “ " 

two batches of electric lamps hy 

of each are burnt out after having been m u e for say 
500 hours. Or a colony of the banana ily may be 
Lpt in a bottle under standard conditions, and the 
grLth in numbers observed. However, ‘here is 
Lthing necessarily statistical in the technique app cd 

in such experimenu, although * 

character are often classed as statistical ■" he w‘dcs 
sense of the word. The introduction of "-e emteept 
of pieces of matter as populations of c ectrons . 

does not necessarily turn an ordinary p y^>ica 
gallon of the macroscopic properties of matter in 

Statistical methods and calculations arc 
however, when the laws for the popu ation are dedu cd 
from those for individuals. The calculation of ^ V'‘ 
tical constants is a case in point, and the es i 
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of some quality of a batch of electric lamps from 
calculations made on the full frequency distribution of 
lives is another. Estimates, made by demographers, 
of the size and age composition of the future popula- 
tion from a consideration of the characteristics of the 
present population and the various birth and death 
rates, are an important example of the statistical 
deduction of statistical laws. Such calculations may 
involve complicated mathematics. 

It is implicit in all I have written that statistical laws 
have nothing to do with individuals. It is no excep- 
tion to the statistical law that old men have old wives, 
on the av erage, if one old man of one’s acquaintance 
has a young wife. A failure to recognize the distinc- 
tion between the two types of laws sometimes leads to 
attempts to apply statistical laws to individuals, with 
paradoxical results. 

W'c nctw return to the starting-point of this chapter — 
a consideration of individuals. They in the aggregate 
are the population, and from their characteristics we 
can calculate those of the population. W'c cannot 
perform the reverse process. Individuality is lost, as 
far as tlie statistician is concerned, for good and all. 
Does this mean we know absolutely nothing of the 
individual when we know the population? Not quite. 

Consider a single electric lamp taken at random from 
the batch represented by the distribution of Table 5 
(p. 41). Even if we do not know its life, we know 
that it will be an exceptional lamp if its life is greater 
than, say, 2.S00 hours — it will be one of 4/i50ths of 
the batch. Indeed, it is more likely to be one of the 
89, i5oths of the lamps with lives between, say, 1,000 
and 2,000 hours. 
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We are used, in ordinary life, to dealing with data 
of this kind by introducing the concept of proba- 
bility. In the example quoted we would say that the 
probability of any one lamp having a life greater than 
2,800 hours is 4/150=0027. and that the probability 
of the life being between i.ooo and 2,000 hours is 


89/150=0-593. 

This is an application of what is commonly regarded 
as the statistician’s definition of probability as a ratio 
of frequencies. Corresponding to any frequency 
distribution there can be calculated a whole series of 
probabilities of a random individual lying within 
various stated limits, and statistical probability is a 
device (a verbal trick!) for attaching to the random 
individual the characteristics of the whole distribution. 
In this way, a population is epitomized in an individual 
much more satisfactorily than in the concept of the 
average man’. But statistical probability does more 
than this. It corresponds closely to the more popular 
idea of probability as a measure of the strength of 
belief in a thing. Most people if asked what is the 
probability of a tossed penny falling heads uppermost 
would reflect that heads was as likely as tails and would 
reply: one-half. The statistician, if in a pedantic 
mood, would reply: in the hypothetical population of 
tosses, one-half of the total give heads, therefore the 
probability of a head is one-half. An alternative 
method of expression is to state that the chances of a 
head arc even, or for tlie lamps, that they are 593 to 407 
in favour of a life of between i ,000 and 2,000 hours. 

Probability is, in ordinary life, also applied to events 
that do not occur as frequencies. We speak of the 
probability of or the chances in favour of a particular 
horse winning a race. Even in such instances. 



120 STATISTICS 

however, I think that people carry at the backs of their 
minds the idea of frequencies ; they in effect imagine 
a lot of races, in a given proportion of which the 
particular horse wins. The idea is described in the 
following quotation from a lecture given by Karl 
Pearson in 1892: 

‘A friend is leaving us, say in Chancery Lane at 
4 o'clock in the afternoon, and we tell him that he 
will find a Hansom cab at the Fleet Street corner. 
I'hcre is no hesitation in our assertion. We speak 
with knowledge, because an invariable experience has 
shown us Hansom cabs at 4 o’clock in Fleet Street. But 
given the like conditions within reach of a suburban 
cab-stand, and* our statement becomes less definite. 
Wc liesitate to say absolutely that there will be a cab: 
“You arc sure to find a cab”, “1 believe there will be 
a cab on the stand”, “There is likely to be a cab on 
the stand”, “'riiere will possibly be a cab on the 
stand”, “There might perhaps be a cab”, “I don’t 
expect there will be a cab”, “It’s very improbable”. 
“You are sure not to find a cab”, etc., etc. In each 
and everv case we go through some rough kind of 
statistics, once we remember to have seen the stand 
without a cab; on occasions few and far between, 
“perhaps on an aterage once a month”, “perhaps 
once a week”, “every other day”, “more often than 
not there has been no cab there”. Certainty in the 
case of Fleet Street passes through every phase of 
belief to disbelief in the case of the suburban cab- 
stand. If once a month is the very maximum of 
times I have seen an empty cab-stand, my belief that 
mv friend will find a cab there to-day is far stronger 
than if i have seen it vacant once a week. A measure 
of my belief in the occurrence of some event in the 
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future is thus based upon my statistical experience of 
its occurrence or failure in the past.’ 

Thus probability in its most general use is a measure 
of our degree of confidence that a thing will happen. 
If the probability is i-o, we know the thing will 
certainly happen, and if the probability is high, say 
0-9, we feel that the event is likely to happen. A 
probability of 0-5 denotes that the event is as likely to 
happen as not, and one of zero means that it ceruinly 
will not. This interpretation, applied to statistical 
probabilities calculated from frequencies, is the only 
way of expressing what we know of the individual 
from our knowledge of the population. 

Statistical laws, which describe the characters and 
behaviour of populations in one way or another, may 
be transformed into probabilities — i.c. from them the 
probabilities and frequencies in the population may 
be calculated. Thus, statistical laws are the chance 
laws referred to in the early part of Chapter VII. 

It may have been noticed that probabilities have 
been calculated from the frequencies of a distribution, 
either known as for the lamps, or assumed as for the 
penny. In general, it is necessary to have some data 
on which to calculate probabilities. 1 am often asked 
what is the probability of some queer or interesting 
event, without being given any data. Statisticians do 
not evolve probabilities out of their inner consciousness, 
they merely calculate them. 



CHAPTER IX 

STATISTICAL REASONING 

Statistical facts may have interest purely as a 
description of something that has happened or of an 
existing state of things, and certainly have great value 
in practical affairs. But we are seldom content with 
this use ; we try to interpret the facts and expect 
them to tell us something of the underlying processes 
at work in the world we are studying. It is in this 
connexion that statistics is mostly misused. In this 
chapter I propose first to illustrate some of the mis- 
uses, and then to describe the ideas behind the ways 
in which statisticians try to learn from statistics. 
Statistical reasoning is not really different from any 
other kind of reasoning, and since the statistical 
method is a special case of the general scientific 
method I shall devote some attention to the latter. 

1 think there are two reasons wliy statistics are so 
mvich misused. First, the desire to interpret them is 
so strong within us that it is almost an instinct, and 
we are apt to embark on interpretations too easily, 
without adcqifatc mental preparation, and even 
without training in scientific habits of thought. 
Consequently we tend to jump to the superficially 
ob\ ious conclusions, which all too often are not the 
correct ones. The interpretation of statistics is a 
matter for the expert — although one may be an expert 
without necessarily having a university degree in the 
subject. Second, statistics are so often made to serve 
the purposes of a propagandist — the man w’ho does 
not use figures to arrive at the truth of a matter, but 
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thinks he knows the truth and only wishes to convince 
other people. The propagandist may misuse statistics 
honestly, from ignorance, or dishonestly and deliber- 
ately; or he may countenance that more subtle form 
of dishonesty of presenting data in a way that is 
formally correct, but misleading, leaving it to the 
public to draw the obvious but false conclusion. 
This last trick is most insidious when it is accompanied 
by some remark intended to disarm criticism, such as : 
‘1 know these figures may be interpreted in scseral 

ways, but . . • 

One favourite trick of the propagandist is to use 

some impressive but irrelevant figures to give a 
spurious appearance of precision under the cover of 
which a dubious argument is 'slipped across’ to the 
public. Commercial advertisements often do this 
‘Five thousand and sixty-seven typists were asked 
what they prefer in shoes and four thousand nine 
liundred and ninety-five, or 98 6 per cent., prefer 
comfort to smartness; therefore buy X\ Z shoes. 
That is the kind of argument one meets. 

A common source of error is the use of inaccurate 
data, of misleading methods of presentation, or of 
data that are so incomplete as to be misleading. 1 
have shown in Chapters II to VI many ways in which 
this may arise. Here is an example about a subject 
that has been of a very lively concern to the British 
public. On 13th May 1941 most newspapers reported 
the following facts of the numbers of German night 
bombers brought down in raids over Britain . 

‘The total for the eleven nights of May now stands 
at 133; the previous highest figure was 90 for the 

whole of April.’ • 

That suggests an enormous improvement m tne 
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effectiveness of British night defences. Indeed, the 
quantitative impression is of an increase in the number 
of bombers brought down per night from 90/30 =3 0 
to 133/11 = 12-1. This comparison should not be so 
interpreted without a knowledge of the numbers of 
bombers used. Those figures were not given, but we 
noticed at the time that most of the raiding in April 
was confined to the ten or eleven nights before that 
of the full moon. Full moon was on April ii/i2th 
and May lo'iith. Up to and including the night of 
April nth, 51 night bombers were brought down, 
giving a rate of 4-6 per night. This is less than the 
rate for the first eleven nights of May, but not so 
much less as the figures originally quoted suggest. 
We cannot be sure that the number of nights before 
the full moon is the significant basis of comparison, 
but it is almost certain that the crude comparison gave 
an impression which, to the Britisher, was unduly 
optimistic. 

The pattern of cause and effect in the world which 
produces statistical data is verv complicated, and to 
any set of figures several plausible interpretations are 
usually possible. It is a common error to consider or 
give only one interpretation, to the e.vclusion of others 
that may be equally reasonable but perhaps less agree- 
able to the propagandist. The following advertisement 
appeared in 1931 : 

‘ It is men of exceptional experience who arc buying 
X . . . cars to-day. 

* 87 per cent of X . . . cars to-day are bought by 
men who have owned six other makes of cars before.’ 

I suppose it is unlikely that as many as 87 per cent of 
all makes of cars are bought by such veterans as those 
mentioned in the advertisement, and the purchasers 
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records? Moreover, the poorness of the data is 
covered over, doubtless unintentionally, by some ver)’’ 
exact but irrelevant figures. It does not matter ‘two 
hoots’ how many children were questioned, or how 
many took porridge, ^^'ithout these figures, the data 
would be seen to be what they are — weak. However, 
even if the facts are taken at their face value, the letter- 
writer errs in considering only one of the factors that 
coultl have contributed to the alleged results. Most 
people take milk with porridge, which might be extra 
to milk taken otherwise, and that might be the cause 
of the improxed health. All we know, if we know’ 
anything from the data, is that oatmeal plus milk plus 
the condiments arc good for health, as compared xvith 
the food that is eaten as an alternative. 

^\‘rong conclusions are sometimes drawn from data 
of quantities that change in time, and it is a standard 
part of the statistician’s functions to recognize 
‘ nonsense correlations ’. For instance, Mr. Udny Yule 
refers to the fact that the proportion of marriages 
solemnized ir» the Church of Fngland and the death- 
rate for the country have for manv years been 
decreasing- there is a correlation between the two 
quantities. I doubt, however, if anyone supposes that 
this fact implies a causal relationship, and that a law’ 
pnjhibiting the solemnization of marriages in Anglican 
churches would reduce further the mortality rate of 
the nation. 

A neglect to consider the errors of sampling or the 
efTeets of random fluctuations sometimes leads to false 
interpretations of statistical data. These effects have 
been dealt with in Chapters VI and ^'II. 

So much for the things that should not be done in 
handling statistical data. Let us now’ consider the 
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more positive aspect of what should be done. To 
provide a special case, on which the discussion will be 
«ntred, I will remind the reader of Lamb’s Dissertation 
upon Roast Pig. According to Lamb’s imaginary 
Chinese manuscript the art of roasting, or rather 
broiling, was accidentally discovered in the following 
mannen A swineherd. Ho-ti, left his cottage m 
the care of his eldest son Bo-bo, who, being fond 
of playing with fire, let some sparks escape into a 
bundle of straw, which reduced the cottage to ashes. 
‘Together with the cottage . . . what was of much 
more importance, a fine litter of new-farrowed pigs 
no less than nine in number, perished. Bo-bo was 
in utmost consternation. . . . While he was thinking 
what he should say to his father an odour assailed his 
nostrils. A premonitory moistening at the same time 
overflowed his nether lip. He stooped down to fee 
if there were any signs of life in the pig. He-burnt 
his fingers, and to cool them he applied them m his 
booby fashion to his mouth. Some of the crumbs of 
the scorched skin had come away with his fingers and 
for the first time in his life (in the world s life indeed, 
for before him no man had known it) he tasted 
crackling \ . . . The truth at length broke into his slow 
understanding, that it was the^pig that smelt so, and 

the pig that tasted so delicious.’ 

When Ho-ti returned there were at first the mis- 
understandings Bo-bo expected and feared, but 
gradually the great truth was borne in upon Ho-ti s 
mind and ultimately ‘both father and son fairly sat 
down to the mess, and never left off till they had 
dispatched all that remained of the litter 

Ho-ti of course wanted more roast pig. and so, as 
often as the sow farrowed, so sure was the house ot 
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Ho-ti to be in a blaze’; and later, after the secret had 
been dragged into the light in a law court, ‘there was 
nothing to be seen but fires in ever)' direction 

This is an example of the empirical method. Certain 
results arc obser\ed to follow from a certain set of 
circumstances, so in order to repeat the result the 
circumstances are repeated. This method is often 
regarded disparagingly, but it is \ndely and successfully 
used. If we know that certain desired ends can be 
achieved by certain means, we rightly use those means, 
without waiting until we can find out how they work 
and if all the means are necessar)' to achieve the ends. 
Our forefathers would have been foolish had they 
waited for the discovery of vitamin C before making 
use of the knowledge that fresh vegetables in the diet 
prevent scurfy. Indeed, medicine is a fine example 
of the successful use of empirical knowledge (I do 
not imply that medicine does not also use scientific 
knowledge). 

The empirical method, however, does not take us 
very far, and often leads us astray. Experience leads 
us to believe that always, if we can re-establish exactly 
all the circumstances that gave rise to a result, that 
result will be repeated exactly; but we can never be 
sure of rc-cstablishing all the circumstances. More- 
over, not all of them are essential, and without some 
analysis of the causes that operate we may repeat certain 
non-essential circumstances and omit essential ones 
For example, we are not told which way the wind was 
blowing when Ho-ti ’s house first burnt ; as it happened 
that did not matter, but if it had. and Ho-ti had not 
taken it into account, the empirical method might 
have failed him. He was lucky to have included the 
important factor in his subsequent trials. 
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The empirical method is not only uncertain; it is 
often wasteful. Not only was it wasteful for the 
compatriots of Ho-ti to burn their houses in order to 
roast pigs, but the failure to discover the general science 
of the use of heat for cooking may have involved thcni 
in burning their boats to cook fish and might even ha\ c 
led to disastrous experiments into the burning of crops 
to improve their taste. It is desirable to discover the 
causes of an observed phenomenon so that the essential 
factors can be reproduced in the most advantageous 
way and applied generally. 

Finally, it is only when our knowledge of the causes 
is fairly detailed that we can continue with iiwestiga- 
tions to improve the result. Until they had discovered 
that it was the application of heat that cooked the pig, 
it would scarcely be feasible for the people of Ilo-ti’s 
time to experiment with the effects of different degrees 
of heat and discover the uses of boiling, fr^^ing, and so 
on. 

All the foregoing are utilitarian reasons why we arc 
not content with the empirical method, but transcend- 
ing them all is the intellectual desire we have to 
‘explain’ things: to describe the relations between 
different happenings : to reduce our knowledge of the 
universe to as few general principles as possible. 

It is the main function of science to analyse the 
causes of events and build up a system of general laws, 
and so we regard scientific knowledge as the opposite 
of empirical knowledge. This is the sense in whicli I 
use the word scientific in this book. Science and 
empiricism, however, differ only in degree, and 
scientists of the present generation, at least, arc very 
modest in not claiming any sort of finality for the laws 
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they formulate. Ho-ti was working on the very lowest 
level of empiricism when he burnt his house to roast 
pig and he would have been more scientific had he 
recognized that it was only necessary' to have some sort 
of fire. We know now that even fire is not necessary, 
and that heat of a sufficient degree produced in any way, 
e.g. electrically, will roast the pig; but it is conceivable 
that some scientist of the future will discover the 
essential chemical and physical changes that occur 
when pork is roasted and will give us other ways of 
achieving the result. What the next stage after that 
can be I cannot imagine, but it would be very rash to 
say there will be no next stage. The point Is that none 
of these stages is purely empirical and none is purely 
scientific; they differ only in degree. 

The scientific method is so efficient, and has been . 
succe,sstul in giving man so much power over his 
environment — power to construct as well as to destroy 
— that we are apt to overlook its limitations. A 
scientific description is by its nature a simplified de- 
scription of a phenomenon and its relation to other 
parts of the universe, and is far from complete. This 
has been shc>\^n strikingly in connexion with the 
science of dietetics. Earlier in this centurv, food 
values were largely measured in terms of calories and 
the chemical constituents — fats, carhohvdrates, pro- 
teins. and so on. Then it was found that these things 
were not enough, and that a diet was deficient unless 
it contained a due quantity of the various vitamins. It 
is because they have no faith in the completeness of 
the existing scientific description of dietetics that most 
people prefer to rely mostly on the empirical method 
in arranging their diet, choosing to eat such natural 
foods as general experience has shown to be good, and 
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the irrelevance of evei^thing but fire and pig, although 
he might be a bit puzzled because all fires did not 
produce the desired effect (if they were not hot enough 
or did not last long enough, or were too hot or lasted 
too long). This last obser\-ation would probably lead 
him to try cooking the pigs on fires of various sizes and 
durations — and so the process would go on. 

This is the experimental method of investigation: 
the method of producing the circumstances surround- 
ing a phenomenon in various ways and combinations, 
but always under experimental control. The art of 
this method lies in the proper choice of combinaUons 
of circumstances and in the craftsmanship of exercising 
the required control ; and if these have been well done 
there is little difficulty in correctly appraising the 
results, although great acumen may be necessary to 
weld them into a coherent scientific theory'. 

In most fields that arc the subject of statistical 
inquiry, the opportunities for controlled experiment 
are very few. Society will not permit many experi- 
ments on man. So the statistician, debarred from 
varying the circumstances surrounding the subject of 
his investigation, has to observe the results of such 
variations as occur without his inter\’ention and learn 
from them, disentangling as much as he can from the 
‘tangled skein’ of causes and effects. Thus, if the 
Chinese authorities had prohibited experiments by 
the physicist into the production of roast pig, the 
statistician would be called upon to obsers'e closely the 
results of the fires that appeared ‘in every direction’, 
in the hope that the circumstances surrounding them 
would be varied enough to enable him to decide which 
were essential and which were not. 

This limitation does not preclude the application of 
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number of country areas. The difference between the 
two a\ crages would then be regarded as measuring the 
required effects. 

'riie use of correlation methods is an extension of 
this idea. As I have already stated, the trend shown 
in Figure 15A (p. 53) discovers the average effect on 
the wheat crop of changes in the area cultivated, and 
the diagram is arranged to emphasize this effect. A 
correlation mav result directly from the operation of 
a cause, but I have already emphasized that its exist- 
ence docs not prove a causal relationship. A careful 
analysis is nccessarv before such an interpretation of 
a correlation is legitimate, and usually there must be 
additional grounds to support it. In this analysis, 
statistical methods play a part, particularly the method 
known as partiul correlation. The following is an 
e^ample. 

In an investigation by ^Ir. I). Glass into factors 
associated with changes in birth-rates, the following 
data for 1030-32 for the separate counties of England 
and Wales were used; (i) the gross reproduction rate, 
which is similar to the net reproduction rate (p. 61) 
except that tto account is taken of the incidence of 
dcatli: (2) tlie percentage of fentales over 15 years 
of age who were unmarried, which I shall call the 
peremtage spinsterhood ; and (3) the percentage of 
tcmalcs over 15 years of age who were in employment, 
which I shall shortly describe as the emplovment rate. 
These tlirec quantities were taken in pairs and corre- 
lated (sec Chapter IV and p. 76). with the following 
results; — (</) there was a correlation coefficient of 
“®‘433 between factors (i) and (2), expressing a fairly 
weak but detinite tendency for the reproduction rate 
to decrease as the percentage spinsterhood increases — 
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a result not unexpected; ( 4 ) the correlation coefficient 
between factors (i) and (3) was -0.625. expressing a 
stronger tendency for the reproduction rate to decrei^e 
as the employment rate among women increases; (r) 
there was a correlation coefficient of +0-530 between 
factors (2) and (3); i-e. the greater the percentage 
spinsterhood the greater is the employment rate among 
women. On the face of things, it looks as though a 
hifih percentage spinsterhood and a high employment 
rate both reduce the reproduction rate ; but these two 
quantities are not independent, and their effects arc 
intermingled. Suppose, for example, the employment 
rate of itself had no real effect on the reproduction 
rate • it would reflect in some degree the influence ol 
the ’percentage spinsterhood and show an apparent 
effect, on the argument : 

high pcrcentugc spinsterhood leads to low reproduc- 

tion rate, -.i 1 • u 

high employment rate is associated with high 

percentage spinsterhood, • 1 -.u 

therefore, high employment rate is associated with 

low reproduction rate. 

This effect could explain some, at least, of the 
apparent correlation between factors (i) and (3); and. 
sirnilarly, any causal effect the employment rate had 
could explain some of the apparent correlation between 
factors (i) and (2). The method of partial correlation 

enables us to separate out these effects. 
correlation coefkient between factors (i) and (2) uhith 
measures the effect of percentage spinsterhood alone, 
is -0-15, expressing a very weak association, which 
is of negligible importance. That is to say, if the 
employrnent rate among women is kept constant, the 


136 STATISTICS 

percentage spinsterhood is practically unrelated to the 
reproduction rate. The partial correlation coefficient 
between factors (i) and (3) is -0-52, so that if the 
percentage spinsterhood is kept constant there is an 
appreciable tendency for a high employment rate among 
women to produce a low reproduction rate. These 
resvilts, as far as they go, suggest that, in order to in- 
crease births, it is not much good increasing marriages, 
but the discouragement of employment among women 
might have some effect. It is necessary in giving this 
summing up of the results to emphasize the words as 
far as they go. The results only apply to such varia- 
tions as occurred between counties in 1930-32, and 
there may be other important causal factors, of which 
no account has been taken. 

I have described the exhaustive investigation of all 
the circumstances surrounding the first production of 
roast pig as sound but unimaginative; progress in 
science has been mucli facilitated by the imaginative 
procedure of using working hvpotlicses in planning and 
making investigations. I cannot discuss this method 
of approach to a problem in full, but roughly it consists 
in making a tentative hypothesis based on existing 
kno\Nledge and ideas, and testing it by arranging 
experiments that give one result if the hypothesis is 
correct, and another if one or more of a number of 
alternative hypotheses are correct. For example, the 
Chinese ph>sicist, on being told that Bo-bo and Ho-ti 
both burnt their fingers when they first touched the 
pig, might, in a flash of genius, see that heat had some- 
thing to do with the transformation of the pig, and 
would direct his experiments to testing the hypothesis 
that fire was the only factor of importance. As ad- 
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missible alternative hypotheses, he might 
Dossihility that the important factors were combinations 
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been an apparent tendency for theoretical speculation 
to outstrip verification by obser\’ation of the real world 

(p. 167). 

Tlie use of working hypotheses, both the main one 
and its alternatives (the alternatives are all too often neg- 
lected by the amateur), is very important in statistical 
research. They guide the statistician in planning 
his inquiry, in choosing what data to collect or use, in 
arranging and presenting them, and in deciding what 
statistical constants to calculate; and finally the results 
arc examined in their light. Without such aid in 
selecting from the enormous range of possibilities, 
progress in knowledge would be slow, and much effort 
would be wasted in useless work. For example, in 
investigating the porridge question mentioned in the 
letter quoted on page 125 wc might dismiss the likeli- 
hood of the condiments having any effect on health, 
and adopt the hypothesis that oatmeal and milk both 
have good effects, with the three alternatives that 
benefit is derived from (i) oatmeal alone, (2) milk 
alone, and (3) neither milk nor oatmeal. Then we 
would measure separately the health of children who 
took (</) oatmeal and milk, (6) oatmeal without milk, 
(c) milk alone, and (if) neither milk nor oatmeal; if it 
were impossible to find anyone who took oatmeal alone, 
an experiment might be necessary. If the question of 
the effect ot the condiments was also included, the 
inquiry would l-»e mt)rc complicated. In making these 
suggestions. I have neglected the important questions 
of the quantities of oatmeal and milk given to the 
children, and of the alternatives to these foods (for 
presumably children who do not eat porridge have 
something for breakfast); all these w'ould need to be 
considered in a comprehensive studv. 
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Thus, it is better to approach an inquiry in the light 
of existing knowledge, and to arrange it to answer 
certain specific questions, than to collect the data, 
subject them to a routine of statistical reduction, and 
then passively accept whatever results emerge. I do 
not deny that this passive approach may often yield 
good results, but it is inferior, except in some new 
field where there is no previous knowledge on which to 
base hypotheses. The use of statistical data to prove 
a case, in the sense of demonstrating it, is unscientific; 
but their use to prove a case in the old-fashioned sense 
of testing it is scientific and profitable. A statistical 
inquiry should be approached with a mind that is open 
but not empty. 

The success of this method of approach depends to 
some degree on the main working hypothesis being 
somewhere near the truth. .A false hypothesis can do 
no permanent harm, for ultimately it will be dis- 
credited, and investigations inspired by such often lead 
to valuable discoveries. Nevertheless, a false trail may 
for a time be set and time may be wasted; and an 
investigator who was too often on the wrong trail would 
not get very far. 

There are no golden rules for the formulation of 
hypotheses, and their quality and success depend much 
on the knowledge and experience of the investigator in 
the field in which he is working, and on his intuition, 
acumen, and genius. Hypotheses may grow in the 
investigator’s mind in the course of his work, they 
may come in a mental flash, or they may be suggested 
by some external accident. Some workers find it 
helpful to write individual facts on separate cards and 
play a kind of game of ‘ patience ’ with them, sorting 
the cards into - combinations suggesting a variety of 
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relations between the facts. The ability to formulate 
fruitful hypotheses and design experiments to test 
them is the quality of a first-rate scientist. In addition 
to this personal quality, habits of thought and even 
prejudice have their influence on the kinds of hypothesis 
that will be entertained. For this reason, impartiality 
is essential ; and an investigator is most likely to be 
impartial if he is disinterested in the issue of the 
inquirv. The investigator should not be narrow- 
minded, and should be prepared to consider any 
reasonable alternatives to the main hypotheses he 
favours, but he cannot afford to waste his time on un- 
reasonable ones. Sidnev and Beatrice Webb, w'ho 
have much of interest and value to say on the subject 
of social investigations, have written: 

‘We have found it useful, in the early stages of an 
investigatif'n, deliberately to “make a collection” of 
all the hvpothescs we could at that stage imagine which 
seemed to have any relevance whatever to the special 
kind of social institution that we were dealing with. 
We noted them all down on our several sheets of paper, 
and others as we went along: wise suggestions and 
crazy ones, plausible theories and fantastic ones, the 
dicta of learned philosophers and those of “cranks’ 
and monomaniacs, excluding those that we thought 
had no possible relevance to our work, such as the 
prophecies extracted from the measurements of the 
Great Pyramid, or those of the astrologers.’ 

This passage suggests a breadth of outlook which I 
imagine most scientists would applaud, but not so 
many show ; but even the Webbs stock at taking 
astrology seriously. Tet 1 cannot sec why they should 
exclude this and include the theories of cranks and 
monomaniacs ; those who believe in astrology have a 
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perfect right to say that, in this matter, the \Vebbs 
show prejudice. The exact position at which one 
draws the line between what is reasonable and un- 
reasonable is largely a personal matter. 

Hypotheses formulated at any one time also tend to 
follow trends or fashions. For example there are great 
similarities between the theory of natural selection and 
the laissez-faire theories of economics, and there is some 
vogue in these days for applying the dialectical process 
to the pursuit of knowledge in various fields. The 
favourite hypothesis with which statisticians usually 
first examine data is, that the obscr\ed variations and 
effects are due to random errors or to chance rather 
than to the operation of newly discovered causes. 
This can be tested by the theory of errors, and the 
statistician will almost invariably hold it as long as it 
is compatible with the data. In this way, one of the 
statistician’s chief functions is to act as a dc\ils 
advocate against the admission of new knowledge. It 
has been said that ‘Bacon was eminently the philo- 
sopher of error prevented, rather than of “progress 
facilitated”.’ The same might almost be said of the 
statistician. In the fields to which statistics is mostly 
applied, the prevention of error is a most necessary 
function ; whereas there are plenty of people ready to 
facilitate progress. 

Frequently, several reasonable hypotheses are com- 
patible with the data, and then fresh data are necessary 
before any discrimination can be made between them. 
Even without such data, however, one hypothesis may 
be preferred to the others. The hypothesis already 
referred to, of chance being the cause of the effects, is 
a favourite one, and derives from the scientist’s general 
preference for the simplest explanations involving the 
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introduction of the fewest new quantities or ideas. 
Which of the alternatives is the simplest, depends on 
the investigator’s idea of the general scheme of things, 
for the simplest is that which fits most easily into such 
a scheme. Tor example, data have been given showing 
that the severity of attack from smallpox tends to in- 
crease as the .time elapsing since vaccination increases, 
and is greatest in patients who have not been vaccinated. 
To those who are not against vaccination on other 
grounds, the obvious inference is that this treatment 
is effective as a protection against bad attacks of small- 
pox. Anti-vaccinationists, on the other hand, find it 
easier to explain the above data on theories which, to 
the outsider, seem very complicated. As a statistician 
I cannot condemn those theories, although as a man 
who generally favours orthodox views in science I 
prefer the ni(»re obvious inference. 

’I'lnis the statistical method, like scientific method in 
general, is based on certain fundamental principles, but 
it is not entirelv automatic in its operation, and progress 
in knowledge depends to a considerable degree on the 
personal qualities of the investigator. Me must be 
creative of ideas, yet should strike a nice balance 
between being too far-fetched and fanciful on the one 
hand, and being so conservative on the other that he 
impedes progress bv his unwillingness to admit new 
knowledge and ideas. 

I have insisted that the statistical method of investi- 
gation is scientific. Its critical apparatus is sufficiently 
well developed and discriminative to prevent an undue 
proportion of false conclusions being reached as a 
result of statistical inquiries (a certain amount of risk 
must be taken if progress is to he maintained). At the 
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worst the verdict may be that a particular inquiry 
teaches nothing new. But the question remains Is 
the method powerful? Do statistical investigations 
often lead to'positive conclusions! In the next tr™ 
chanters I give an estimate of the usefulness of the 
shitLical mMhod in the various fields of application 
but here give only a brief general answer to the 

‘’'wher'clear conclusions emerge obviously from a 
simple arrangement of the data the statistical method 
w ii<5eful in a positive way. When. hoN\e\er, the 
pattern of causes and effects is complicated, and 

daborate statistical analysis "f^'^^^^XTTevLl 
rUisions are not often reached. bo oltcn. sc\c 
hypotheses are compatible with the Jata, and when an 
analysis like that described on pages 134 to 136 is per 
formed one cannot be sure that it is complete and t at 
all factors have been accounted .for. Consequent ^, 
the results of a purely statistical inquiry do not usually 
Inrh above a fairly low level of empiricism, and 
Xy s "^mific lawJ that are arrived at are largely 

iustified on theoretical grounds. 

^ 1 present this view somewhat in a spirit of disi lusioii- 

men^ W^hen first introduced to me, the methods 
statistical analysis, particularly that of partial correl - 
t ™ seemed to have unlimited power to penetrate the 
secrets of nature. I think, too, that this enthusjasn 
inspired the statisticians who developed the "lethods 
during the early years of this century and has been 
sS by many others, although I have no documen- 
tary evidence of this. Certainly, compared 'v|th such 
hinh hopes the achievement has been disappointing, 
"'te sututical methods therefore useless and mus 
we abandon them ? No 1 They are powerful, et en if 
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their power is limited, and there are fields in which 
only they can advance knowledge. We must persevere 
with them in a spirit that is steadfast, if somewhat 
chastened, believing that progress will be maintained, 
if more slowly and with greater difficulty than once 
seemed likely. Moreover, statistical methods are 
proving exceedingly powerful and are achieving much 
in the statistical-experimental kinds of investigations 
described in Chapter VII (pp. 103 to 105). 

The use of statistics for discovering the forces at 
work in the social, economic, and similar spheres, 
where experiments arc impossible, is a very difficult 
application of the scientific method. Many causes and 
clTccts are entangled so that it is hard to separate and 
relate them. Yet even the ordinary citizen needs to 
have some ability at least to distinguish what may be 
truth from what is probably falsehood, especially in a 
democracy, where he has to make up his mind on many 
difficult public cjuestions and contribute to the growth 
of public opinion. Surely it is an important task of 
education to give the citizen this ability by teaching 
the elements of statistical reasoning. If this is done, 
people will develop not only the ability to look at 
controN ersial social and similar problems scientifically 
and dispassionately, but also the habit of doing so. 


CHAPTER X 


STATISTICS IN AFFAIRS 

The use of statistics in the business of running the 
country through its political, commercial, and social 

thosr activities that deter^ne the 

health wealth, and happiness of manland is t 
oldest’and the most considerable use. The Ancien 
E^ptians had a centralized form of goicrnmcn 

admUtered with the aid 

knowledge of the economic conditions of the J 

(e r c'ular returns were made of the level ot the Nile 

^ the nrosperity of the country so much 

on which P P Book contains the 

Ss o ’a smisS survey, and Lre ate evidences 
S statistics having been used in “d™-tra‘.on no v 
and acain during the subsequent centuries. Ui 

•" 3 :: 

to regard it as the whole content ot statrst.cs. 

Thfu e®d for statistical knowledge .n ^nn.ng a 
Inrreascs OS thc concem bccomes larger and 
“"“romoTc^el One man can conduct the affairs 
^tTfamily or a small business with few figures, but as 
The scaTe of thc enterprise becomes larger it becomes 
Ls poBsibt for one man, or even a few men, to have 
at th^e same time the necessary intimate knowledge o 
a 1 the parts and the broad knowledge of die wliole^ 
I^^ence an organization is set up whereby the nien in 
“ ntral positions work largely through statistical know- 
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ledge of the parts under their control — knowledge that 
is statistical in the two senses of being numerical and 
summarized. As more industries have fallen under 
the control of large combines, and more of our activities 
have come under the control of the largest combine of 
all, the State, statistical knowledge has become increas- 
ingly important. Planning is the order of the day, 
and without statistics planning is inconceivable. 

In most of the applications of statistics considered 
in this chapter the most elementarv' statistical methods 
sulHcc. Consequently, this aspect of the subject is 
comparatively uninteresting to the mathematical statis- 
tician who specializes in the development of theory 
and technique; but it is well that it does not require 
great mathematical knowledge, at least to follow these 
parts of statistics which, after all, are of the greatest 
social importance. The economical and expeditious 
handling of masses of figures in large concerns is, 
however, a skilled jol-). requiring powers of organiza- 
tion and a knowledge of what can be done with the 
aiil of the very expensive and intricate accounting and 
sorting machines now available. 

First let us sec how stali.^tics are used in running 
things after policy has been decided. Their most 
elementary use in administration is in the balancing of 
the acti\ lties of one part of a system against those of 
another to secure that supplies equal requirements, 
and that there are no ‘ bottle-necks ’ or parts that are 
not employed to the full. 

For the national government, the necessary statistics 
range from extensive figures of the expenditure of the 
various departments of state and the yields of various 
taxes, required by the Chancellor of the Exchequer in 
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framing his budget, to the numbers of boots of various 
sizes that will be used by the army, required by the 
stores department of the War Office in placing con- 
tracts. War much increases the need for figures of 
these kinds. The allocation of labour, shipping, food, 
raw materials and resources of all kinds, which in 
normal times is left to the free play of economic forces, 
is now (in 1942) directly controlled by Government 
officials working in the light of statistics of the resources 
available and the requirements of the various services, 
industries, districts, and so on. There is little I can say 
about this function of statistics, but a few minutes’ con- 
sideration will convince readers of its vital importance. 

One important subject is the measurement of the 
national income. That, rather than the revenue of 
the Government, determines the resources available 
for war or other national puqioses. For many years, 
the estimation of the national income of Great Britain 
has been left largely to the initiative of private in- 
dividuals, but since 1941 the Government, with its 
access to official records and all its resources, has taken 
a hand, and has started to publish annually estimates 
of the national income and expenditure. 

Local authorities need statistical information to 
enable them to adjust their supplies of various public 
ser\’iccs to the needs, both immediate and future, of 
the districts they serve. When building a new housing 
estate, for example, water, gas, electricity, sewers, 
schools, transport, and so on have to be provided, in 
quantities that arc sufficient but not excessive. Such 
legislative measures as the raising of the school leaving 
age involve problems such as making decisions as to 
how many new schools and teachers will be required, 
and to solve these recourse must be had to census data. 
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The planning of production in a large factory or 
combine is now a part of what is known as Scientific 
Management, and many firms now have a planning 
department to co-ordinate the activities of the other 
departments. The requirements of the sales depart- 
ment with their dcliverv dates are translated into 
orders to the various production departments, with 
intermediate delivery dates so arranged that the final 
products arc delivered as required; these orders are 
translated into orders for raw materials, tools, and 
labour, and the whole activity is organized and timed 
so that the work flows without interruption. More- 
over, to secure efficiency, it is necessary as far as 
possible to balance the sizes of the various departments 
so that thev arc large enough to meet all demands made 
upon them and yet arc not unnecessarily large. For 
this work, statistical returns and charts are much 
vised. The whole subject is highly developed and in- 
volves special knowledge and experience, although the 
statistical methods \ised arc not very elaborate. 

Statistics is useful in administration in providing 
measures of performance and efficiency. Balance sheets 
and statements of accounts (not necessarily the pub- 
lished ones) have this use. Various special indexes 
are also devised, such as the ton-miles of freight carried 
per engine-hour, used by the railways, and the ratio of 
management expenses to premium income, used by 
insurance companies. In a factory, the average output 
per man-hour or per machine-hour and the percentage 
of the total materials wasted or spoilt may be useful 
quantities. With the aid of figures of these kinds, the 
operating efficiency of a concern can be compared 
from time to time or from one section to another; one 
firm can be compared with another or with tlie average 
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for all firms in the industry; and salesmen can be 
compared. The data do not state the causes of m- 
cfficLcy, if any, and much less do they show what to 
do to effect improvements. They are mere > poi^ c . , 
showing administrators where thmgs are go.ng vcl 
and where improvement may be sought; and the 
value depends entirely on the use made ofjhcm^ 

A striking example of the use of statistical recoreU 
for in reasfng efficiency is provided by a costing 
system set up by the L.M.S. Railway Company for 
tLir locomotives. Expenditure on locomotn es rc pre- 
sented such a large item in the expenses of the compan t 
that it was considered worth while to keep “mp'etc 
records of the coal consumption, repair times and costs, 
the attention received in the running ^hed^, and so 
on for each of over 10,000 locomotives separatcl t . 1 h 

task was tremendous, but the results were held to 
Z or. n.^tifv the considerable expense involved. 
"T ily^r cent dec elopment of the application of 
statistics is to the control of the quality of manufactured 
a« C CS Much of modern industry is run on the lines 
“"mass production, and this involves making separa ely 
the standard parts of an article and then assembling 
hem If all the parts were exactly alike, they won d 
c T' ,..l„.r exactly the finished articles would be 

c,xacUy alike in character and quality and all would be 
well ^liut this docs not happen. 1 he raw materials 
vary in quality, important processing conditions such 
aratmospheric humidity and temperature vary, tools 
Tnd machines are used in various states of wear, and 
the operators, being human, 

orecision. The consequence of all tins is that the 
products vary. Some components differ m size or 
^hape from the sundard so much that they will not 
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fit in the final assembly, some are too low in quality to 
give a satisfactory performance. Hence arise a number 
of questions : What variation in quality of raw materials 
shall be allowed before a complaint is made to the 
suppliers? Must evcrj" component be inspected or 
only a sample ? And if a sample, how big should it be 
and how should it be taken? How much variation in 
qualitv may be regarded as normal and at what stage 
does an increase in variation suggest that something 
has gone wrong? What is the relative importance of 
different factors in causing variation? 

All these questions require for their answer a 
mixture of technical and statistical knowledge. The 
ndc of statistics is the making of systematic records of 
quality, the development of measures of variation, the 
working out of the effects of changes in cjuality and 
variation, and the application of the theory of random 
sampling. These problems give scope for the most 
ailvanced statistical ntethods available, although some 
simple standardized methods have been worked out for 
limited application. 

'Fherc arc published examples of this kind of use 
of statistics in the elcctric-lamp-making industry' in 
Germany and England, in a variety of industries con- 
nected with the Bell 'rdephonc Companyin thcU.S.A., 
and in the textile, glass-making, and engineering 
industries in England. The movement is spreading 
rapidly, and it is likely that statistics will one day be 
as widely used in technical control in industr)' as it 
nr)\v is in commercial control and management. 

Statistics also has a part to play in the development 
of policy. Its first part is that of calling attention to and 
describing the nature of economic and social problems. 



STATISTICS IN AFFAIRS 15^ 

In the economic field: Is unemployment increasing 
or decreasing? Is it widespread or largely confined to 
certain areas? Which industries are expanding and 
which contracting? What changes are taking place 

the localization of industry? 

In the social field : Is there a shortage of houses r 
Do poverty and malnutrition abound ? What changes 
are taking place in sickness and mortality r Is the 
rate of deaths due to tuberculosis higher in some 
areas or industries than in others? Are crime and 
drunkenness increasing or decreasing? 

In the field of business and manufacture: Are the 
sales of this or that article decreasing ? Are the costs 
of distribution in a certain area unduly high? In the 
factory, is there an undue loss of production because 
of machines being stopped for repairs? 

All these are questions that may be answered 
statistically, and they arc usually of the kind that 
must be answered before political action is even 

considered. ^ i- 

A second function of statistics in the realm of policy 

is to measure the importance of the various problems 
and to place them in a proper perspective. Although 
most economic and social problems are essentially 
statistical in that they concern masses or groups of 
people towns, businesses, and industries, the men who 
have to deal with them are of the, system, and since 
they have intimate contact with only a part it is not 
easy for them to see the whole. The views of the 
Member of Parliament are coloured by his knowledge 
of his constituency, which may be an agricultural or 
mining area ; a business man tends to view all ^conomic 
problems from the standpoint of his particular industry- ; 
the rent collector in a slum area secs overcrowding as 
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the most urgent social problem; and the comfortabk 
inhabitant of a prosperous town thinks there is nothing 
much wrong with the world. Some events strike our 
imaginations more vividly than others, either because 
of their nature or because of the publicity given to 
them. For example, a railway accident in which fifty 
people are killed creates a greater impression than the 
fact that in Great Britain during 1938 an average of 
over 500 people were killed in road accidents every 
month. In all these circumstances, the individual 
cannot easilv take an impersonal view of things, and it 
is there that statistical data help. 

It is, however, a weakness as well as a strength of 
statistics that they paint a broad, impersonal picture; 
for some of the ‘high-lights’ are lost. We have seen 
that statistical descriptions are essentially summaries 
that leave something out ; and in connexion with social 
problems, in particular, that something is often the 
human touch which fires the imagination and spurs the 
will to action. Statistics can show the prevalence of 
poverty, but they cannot help the rich man to imagine 
what the life of the poor man is like. They can 
measure many of the conditions of life that promote 
happiness or misery, but they cannot measure happi- 
ness. The situation is well stated in the following 
passage from the periodical Planning: 

‘Fublic opinion upon many social and economic 
problems still suffers from an incapacity to grasp 
statistics, and thus fails to measure either the size of 
each problem as a whole or the relative importance of 
its different aspects. In the case of unemployment 
and man-power, however, the reverse appears to be 
true; everyone is only too ready to think in broad 
quantitative terms, even at the price of forgetting that 
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rows of classified statistics are no more than feeble 
symbols for a multitude of men and women each ot 
whom individually represents a special and unique 
problem, which cannot be satisfactorily treated while 
it is simply lumped into some immense aggregate. 

The statistical description can be improved by an- 
alysing the ‘immense aggregate’ into ^ub-aggregates 
as weLve seen in the earlier chapters of this book, but 
at the best it remains only a partial description. 

It is partly for this reason that the statistical investi- 
gator and the man who develops policy should not rely 
Ltirely on figures but should have direct contact with 
the problems with which he is dealing. The following 
passage by Professor A. L. Bowley suggests a realistic 
Tproaefi to a social problem, with a good balance 
between the use of statistics, particular description, 

and direct contact! 

• If for example, we know from the census account 
that in five per cent of the houses of a town there arc 
more than two people to a room, if we ascertain that 
the worse houses arc insanitary and small, and i wc 
visit a few to find out the actual accommodation, the 
ane and sex of the inhabitants and their occupations, 
we have probably all the data we need for criticizing 
or suggesting a policy of reform, without measuring 
the rooms or making a house to house visitation. 

I have insisted at some length on the limitations of 
statistics, hoping to forestall the criticisms of those 
who doubt the value of the subject and to temper the 
enthusiasms of those who over-estimate its power but 
even as a dvnamic of economic and social reform 
exact statistical description has value. The following 
statement made by Sir John Orr in 1936, dO' and 
factual though it, is, should have the effect of an electric 
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shock on a man with a social conscience and an 

*^‘^The tentative conclusion reached, is that a diet 
completely adequate for health, according to modern 
standards, is reached at an income-level above that of 

CO per cent of the population.’ . . , j 

As another example of the power of statistical de- 
scription, we have the following statement by Beatrice 
Webb, showing what effect the publication of 
results of Charles Booth’s social surN'ey of London had : 

‘The authoritative demonstration . . • that as many 
as thirty per cent of the inhabitants of tlie richest as 
well as the largest city in the world lived actually at or 
beneath the level of bare subsistence— came as a shock 

to the governing class.’ , . 

.Action taken as a result of an intellectual conviction 

derived from hard facts is likely to be more resolute 
than action stimulated by an emotional appeal alone; 
and statistical facts are of the hardest metal. 

A third part for which statistics has been cast is that 
of acting as a guide to policy, i.e. of pointing out 
solutions to the problems described; but the subject 
does not play this part as well as it does the others. 
Occasionally a simple analysis of figures leads to a 
valuable conclusion. I'or example, a housing sur\'ey 
of Liverpool made in 193® Alerscyside Social 

Survey showed that overcrowding was not entirely due 
to poverty, since many a Tiousc was overcrowded even 
though it was occupied by more than one wage-earner 
earning good wages. 'I'his information at least suggests 
that a policy of subsidizing rents would not have 
solved the problem of overcrowding in Liverpool. On 
the other hand, there are many important questions of 
public policy on which neither statistics nor any other 
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subject has given a clear answer. Is a policy of 
embarking on public works the best way of curing un- 
employment? Is an expansionist policy the best way 
out of a trade depression? Is a protective tariff really 
good for industry as a whole? How can we reduce 
road accidents ? These are a few questions that have 
been the subject of opinion and controversy. The 
answers to them require a deep knowledge of the 
fundamental causes operating, but I doubt if statistics 
has got us very far. I discuss the place of statistics 
in the discovery of economic laws in the next chapter. 

I have dealt in a general way with the role of statistics 
in the development of policy, and am now going to 
consider some particular aspects of the question. 

The first aspect is that of economic and business 
forecasting. All policies are necessarily worked out 
for future application, and so forecasts must willy-nilly 
be made of future conditions. Local authorities when 
building schools, rcser\’oirs, and so on must forecast 
requirements for years ahead ; a manufacturer building 
a new factory will determine its size partly on his 
estimate of future demand for his products; most 
goods for consumption are ordered and made months 
before they are sold; and the insurance company 
quoting for an endowment life policy to mature twenty- 
five years hence must do so on the basis of an estimate 
of the future rates of interest and mortality experience. 

The prediction of the future from a knowledge of 
what has happened in the past involves the belief that 
things will in some way continue to be as they have 
been. Sometimes we are naive enough to think that 
the superficial appearance of things will continue to 

be it is so easy to take the short view — but economic 

F 
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and business forecasting is based on a belief that the 
relations between past events have been determined by 
fundamental principles that are stable and will continue 
to be so. An empirical forecast based on a superficial 
analysis of past events is not very reliable, but the more 
successful we are in discovering the _ fundamental 
relations between events — the unchanging principles 
that govern change — the more scientific and reliable 
arc our forecasts likely to be. 

As an example, let us consider the prediction of the 
future population, which we do not expect to remain 
unaltered at its present level. To make a prediction 
we might plot a graph showing changes during the past 
few years, and might extend it forwards, continuing, 
sav, the trend. In the absence of anv reason for 
supposing that the trend will continue, tltat would 
provide an empirical forecast which might be roughly 
correct only for a year or two ahead. Alternatively, 
kno\\ing the age composition of the present population, 
it is possible by assuming future birth- and death-rates 
to calculate tlie future population. Birth- and death- 
rates arc much n^ore stable than the size of the popula- 
tion, and their values for a few years in the future can 
be predicted, not exactly, but pretty well. Moreover, 
the calculation is exact, so that the resulting forecast 
of the population is much more accurate than the 
empirical one. except for the effects of factors that 
have not been brought into the calculation, such as 
migration and wars. For Great Britain, these neglected 
factors have not lately been ven.’ important, and much 
useful work has been done in predicting the future 
population of the country, and its age composition. 
However, predictions of population are probably the 
most scientific of any that statisticians make. 
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and (3) movements of Stock Exchange prices are related 
to the same financial factors as those that have an 
important influence on business. The claim that the 
systems work is rather spoilt by the qualification that 
the forecasts have not been correct in abnormal 
circumstances. 

Attempts to predict prices have also been based on 
the well-known theory that prices are dependent upon 
supply, among other things. To be able to measure 
the relation between supply and price for any com- 
modity it is necessary (1) that both quantities should, 
for some time, have changed enough, (2) that supply 
shouUl have been an important factor in causing 
changes in price compared with other factors, (3) that 
the relation between supply and price should have been 
fairly stable, and (4) [which is almost the same as (3)] 
that the conditions of supply (e.g. methods of manu- 
facture) and of demand (e.g. the tastes or habits of 
consumers) should not have changed much. .Articles 
like motor-cars and wireless sets obviously do not 
satisfy these conditions, but a number of primary 
product.^ do to some degree, and moderately successful 
formulae have been obtained for predicting the prices 
tor a tew months ahead from a knowledge of the 
existing supply (e.g. crop) of commodities like cereals, 
cotton, and meat. Other factors atTccting price ha\e 
also been considered in the same wav. 

The man of atfairs often needs to know how the 
demand for an article is related to its price — the 
elasticity of demand— before deciding on a price policy. 
One liritish Chancellor of the Exchequer increased the 
tax on sparkling wines without knowing what effect 
the increased price would have on consumption, and the 
reduction in consumption that occurred was so great 
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that his estimate of yield was completely falsified. In 
1933 the British railways took the very bold step of 
reducing most passenger fares from iW. to id. per 
mile with a view to arresting the decline in revenue. 
Lacking knowledge of the elasticity of demand, that 
important decision had to be made in the hope that 
increased mileage demanded by the public would be 
more than the one-third by which fares were reduced ; 
othenvise revenue would have decreased. This hope 
was. in the event, justified. The problem of estimating 
the elasticity of demand is exactly similar, statistical!) . 
to that of determining the relation between price and 
supply The same conditions are necessary for its 
solution, and similar degrees of success have been 
attained for various kinds of commodities. 

All these problems of forecasting require for their 
solution economic and statistical analyses of the highest 
order Economic analysis is necessary to sugpst 
general relations that may be sought between various 
movements, and to explain relations that have been 
discovered so that they can be formulated with pre- 
cision. The relations are essentially statistical m 
character, i.e. they arc correlations, and statistical 
analysis is necessary to give them numerical expression. 
This analysis gives scope for the application of a wide 
range of advanced methods for separating out trends, 
and cyclical, seasonal, and random movements, and for 
disentangling the effects of the many factors involved. 

I have no direct experience of business and economic 
forecasting, but the impression gained from reading is 
that its achievements are useful but modest. 1 he 
forecasts are frequently fairly near to the actual events, 
but, frequently also, ‘abnormal’ conditions intervene 
to upset them. Business men do not rely heavily on 
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Statistical forecasts, but find it useful to consider them 
together with other information that can only be 
appraised subjectively. The word scientific is bandied 
about a good deal in connexion with business forecasts. 
Knowledge of the fundamental causes of economic 
changes is too meagre and inexact to form the basis of 
methods that a physicist or chemist, say, would regard 
as scientific, and the predictions are too inaccurate to 
be really scientific; but the work is proceeding along 
scientific lines, and as a great deal is being done we 
may expect that the methods will improve. 

Second, I wish to consider the sur\'eys that are now 
made of consumer markets and public opinion. One 
of tlic chief functions of industry is to provide the 
people with tlie things they need and desire, and there 
is a growing tendency among commercial concerns to 
embark on ‘consumer research’ in order to discover 
what the needs and desires of the people are — an 
activity which seems to be a department of advertising. 

A little of this work takes the form of indirect 
statistical investigation to discover what factors influ- 
ence consumers’ demands. Mr. Mordecai Ezekiel in 
his book Methods of Correlation Analysis mentions some 
investigations made in the U.S..\. into the relation 
between the prices received for various products and 
their qualities. For example, it was found that on the 
Boston (Massachusetts) market for asparagus, 38 J cents 
extra per dozen bunches was received per extra inch 
of green in the stalk. 4 cents less per dozen bunches of 
given weight was received for every additional stick in 
the bunch (i.e. there was a preference for few thick 
sticks to many thin ones), and bunches with sticks of 
uniform thickness fetched higher prices than those 
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with variable sticks. It is stated that the results ' have 
had a marked influence on the practices by the pro- 
ducers who supply the Boston market, and have led 
to further experimental investigation as to how to 
produce asparagus with desirable qualities.’ Statistical 
investigations of these kinds arc essentially applications 

of the methods of correlation. 

Most consumer research, however, uses only elemen- 
tary statistical methods. For example, localities may 
be compared for population and the consumption per 
head of (sav) soap, as a preparation for a sales campaign 
in ‘backward’ areas; or districts may be compared tor 
population and spending capacity as assessed from the 
occupations followed, with a view to discovering where 
it would be most profitable to ‘push’ the sales ot 
refrigerators, washing-machines, or wireless sets 

In this field, sample surv-cys are much used. Before 
introducing a new brand of chocolates, one hrm got a 
panel of girls to try chocolates with variously flavoured 
centres and included the favourite flavours in the 
brand*’ and the preference of the public for tlie short- 
headed tooth-brush was discovered by a special sample 
inquiry that was made before introducing a new brand 
of that article. One large stores has investigated the 
habits of a sample of its customers to discover which 
ones buy at the stores regularly, in which departments 
they buy, the kinds of goods ordered by telephone and 
so on, the aim being to provide data on which to base 

sales and advertising policies. 

Sample surveys are also being increasingly applied 
to the measurement of public opinion on political and 
similar questions. For some years American journals 
have conducted ‘straw votes’ on public questions, and 
the results of the Gallup Poll were much regarded and 



l62 STATISTICS 

quoted when the people of Great Britain were anxiously 
following the development of the American attitude to 
the War during 1941. In Great Britain we have the 
corresponding polls of the British Institute of Public 
Opinion, the activities of Mass Observation, and the 
investigations of the B.B.C. Listeners’ Research 
Department and, more recently, of the Ministry of 
Information. Such sun'eys can be reasonably reliable 
and can be conducted at a cost that is not prohibitive 
for a community. Will the development of this ability 
to test public opinion frequently and accurately have 
an effebt on the theory and practice of government? 

Finally, I can do no more than mention insurance, 
a branch of commercial activity which, particularly in 
the life department, is only made possible by statistics. 


CH.\PTEK XI 

ST.\TISTICS AND OTHER SCIENCES 

In this chapter I consider the relations between 
statistics and other branches of knowledge and take 
first the science of economics. There are three 
reasons why tliis subject, so regarded, would appear 
to be closely dependent on statistics. 

First, economic laws, if they exist, refer to mass or 
group phenomena. Economic events are the result of 
actions based on the preferences, desires, and reactions 
of millions of people. Individually, people behave in 
a way that is unpredictable — some would say in a way 
that is indeterminate, i.e. that people have ‘free-will’ — 


STATISTICS AND OTHER SCIENCES 163 

and if there are any regularities in their behaviour they 
are only shown in the behaviour of the mass, as 
described in Chapter \III. 

It docs not necessarily follow, of course, that because 
individual people bear superficial resemblances to 
statistical individuals, the mass must show statistical 
regularities in behaviour; but such regularities do in 
fact seem to exist. The laws of supply and demand, for 
example, apply very widely; and even in time of war, 
when the enforcement of price regulations is backed 
by appeals to patriotic sentiment, ‘black markets’ 
spring into being. The belief that statistical laws do 
in fact describe human behaviour is implicit in the very 
existence of sciences like economics (and psychology), 
and in a rational approach to all business and political 


problems. 

This belief do.es not necessarily carry with it belief 
in the permanence or universality of economic laws. 
For example, the reactions of men to financial incen- 
tives arc conditioned by their ideology, which may 
change with time and place— the ideology of a group 
of Oriental mystics is very different from that of a 
group of English business men. Nevertheless, we 
believe, and act as though we believe, that most 
economic eveflts follow laws which are sufficiently 


stable and widely applicable to be useful. 

A second reason for expecting economic science to 
be dependent on statistics (using the word as referring 
to numerical data) is that the scientific way of discover- 
ing laws involves studying what actually happens, and 
unless the knowledge so gained is quantitative, i.e. 
statistical, it is ‘of a meagre and unsatisfactory kind’. 
The immense progress that has been made in the older 
and more exact sciences of physics and chemistry has 
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depended very much on measurement, and, by analogy, 
one would expect the same condition of progress to 
apply to economics. 

Thirdly, if laws are to be inferred from numerical 
data this must be done by methods that are largely 
statistical, as opposed to experimental. Econornic 
experiments purely for the sake of gaining knowledge 
are not allowed; and, cyen if they were, it tyould not 
often be possible to isolate a few factors for myestip- 
tion as the experimentalist can in his laboratory. 1 he 
consequence is that the economist can only learn by 
obsen ing the eycnls that happen outside his control. 

Statistical data may be used in economic inquiries 
in three ways: (i) they may giye the information that 
suggests and leads to the formulation of theories, (2) 
they may be used for testing theories, and (3) they 
may provide measures of quantities that emerge from 
economic analysis. 

(1) Economic changes can only be described by 
statistics. Yet the analysis of statistics does not seem 
to have been very fruitful in suggesting economic 
theories, jevons’s famous suggestion that cyclical 
changes in prices are correlated with sunspots was 
based largely on an examination of data, but it did not 
have a verv lasting elFect on economic theory. On the 
other hand, statistics showed the existence of the trade 
cycle, which has been the subject of so much economic 
theorizing. 

(2) Economic theories have, from time to time, been 
put to the test by reference to statistics, but the results 
iia\ c not been very impressive. The whole economic 
system is so complicated that it is usually possible to 
suggest a number of theories to fit a given set of 
statistical facts. Consequently, when theoretical pre- 
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dictions have been compared with experience and there 
have been discrepancies, the tendency has been, not 
to abandon or modify the theory, but to find special 
reasons why the particular facts did not conform to 
the theory. The following quotation from some 
remarks made by Professor von Hayek illustrates the 
situation from the standpoint of one economist: 

‘He [Thomas Tooke] showed — and there can be 
little doubt about the fact— that low rales of interest 
usually coincide with falling prices, and high interest 
rates with rising prices, and concluded from this that 
the Ricardian idea, that a reduction in the rate of 
interest would lead to a rise of prices and vice versa, 
was wrong. I doubt whether there is to-day a single 
economist of repute who would be willing to assert 
for this reason, with 'Pookc. that a low interest rate 
leads to a fall in prices or the contrary. I need hardly 
waste time to explain the paradox— but statistical 
research has not helped us in any way to solve the 

diificiilty.' ... 1 • 

Tooke was presumably mistaking a correlation 

arising from movements in the trade cycle for a causal 
relationship. A modern statistician could correct for 
the effect of the trade cycle and obtain a partial correla- 
tion between prices and the rate of interest; but could 
he then be reasonably sure he had a true measure of a 
causal relationship? Only if theoretical analysis sup- 
ported that view. Statistical analysis can separate the 
effects of various factors, given sufficient data, but 
usually only after the factors arc stated by theory. 
Thus it seems that, in existing circumstances, theory 
inevitably controls the analysis of obsertational data 
and is almost unaffected by the results. It is not 
altogether unreasonable for economists to cling to their 



STATISTICS 

theories in spite of discordant statistical facts which so 

often are only apparently discordant. c u^ * 

In situations similar to this it is often profitable to 
take the theory for granted and to use the statistics 
to measure the importance in certain circumstances of 
the factors postulated in the theory.^ On this view, 
one would not have regarded Tooke s results as dis- 
proving the Ricardian idea altogether, but rather as 
showing that some other factor was having a much more 
important elTect in causing the particular variations 
ohser\ed. The value of this viewpoint is greatest 
when the theoretical elTects are important, but not 
all-important. 

(3) Economic theory postulates a number ot quanti- 
ties' such as the elasticity of supply and demand which 
appear only as algebraic symbols, but need to be 
evaluated if they are to be tiscd. Such evaluation is a 
proper function of statistical methods working on 
statistical data. 1 have already, in the previous 
chapter (p. 158), discussed this problem as far as 
supply and demand arc- concerned, and it is only 
necessary here to add that although the subject is a 
dilficult one, much work is being done in it, and 
progress is being made. 


Altogether, the dependence of economic science on 
statistics and the connexion between the two subjects 
has not been as close as might have been expected. 
Economic theory has been developed, until recently at 
least, witli scarcely any appeal to statistics for verifica- 
tion. and theoretical economists and statisticians have 
worked with comparatively little contact. Articles in 
economic journals and economic books contain certain 
algebraic formulae with unevaluated constants and 
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formal diagrams, but very little in the way of observa- 
tional data. Most statistical articles and books in the 
economic field, on the other hand, contain masses of data 
and statistical analysis, but little economic analysis. 

Economists have been criticized for their alleged 
neglect of statistics and fact, and Professor L. Hogben 
has even likened them to medieval schoolmen, spinning 
their theories without any regard to facts. Lord 
Keynes wrote in 1933: 

‘ In economic discussions Ricardo was the abstract 
and a priori theorist, Malthus the inductive and in- 
tuitive investigator who hated to stray too tar trom 
what he could test by reference to the facts and his 
own intuitions. . . . 

‘One cannot rise from a perusal of this correspon- 
dence [i.e. between Malthus and Ricardo] without a 
feeling that the almost total obliteration of Malthus’s 
line of approach and the complete domination of 
Ricardo’s for a period of a hundred years has been a 
disaster to the progress of economics.’ 

The patient collection and analysis of statistical data 
requires a different kind of temperament from that 
required for the development of economic theory, and 
that may be one reason why the two subjects ha\ e not 
come more closely together. However, it would be a 
mistake to suppose that economic theory is completely 
out of touch with reality. Lord Keynes, for example, 
refers to ‘the amalgam of logic and intuition and the 
wide knowledge of facts, most of which arc not precise, 
which is required for economic interpretation in its 
highest form’. 

The mind can deal with facts that are not precise, 
whereas the more formal methods of statistics can- 
not; and the volume of unprecise facts is enormous 
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compared with that of precise facts. That may be one 
reason why qualitative economic analysis has gone so 
far in spite of its comparative noglect of statistics. 
Another reason is probably that the economist is a man 
studving the behaviour of men; he sees the economic 
system from the inside. We may admire the high 
intellectual quality and penetrating power of qualitative 
economic analysis, and acknowledge the success it has 
achieved, and I would not care to say that economists 
have been wrong to have relied more on it than on 
deductions from statistics. 

Nevertheless, there are limits to what can be achieved 
by qualitative analysis alone. Economic laws still 
strike the worker trained in the ‘exact’ sciences as 
being very inadequate. 1 think that from the stand- 
point of method, the analogy between economics and 
(say) physics is good, and economics must develop 
along the same path as phvsics by becoming more 
qtiantitative, and indeed this development is taking 
place. Much statistical work is being done in the 
economic field : work that is more than mere collection 
and jiresentation of data. The difficulties of statistical 
investigations in the economic field are realized and 
being <>\ercomc, and statistical methods are improving 
in power ami tlexibiliiv, and arc becoming more dis- 
criminative. The volume and cogency of statistical 
data are increasing, and particularly valuable material 
is being provided by the records of events and experi- 
ments like the 1931 depression, the introduction of 
protective tariffs in Great Britain, and the New Deal 
projects in the U.S.A. 

It may one day happen that economics departments 
at universities, instead of being dominated by the 
theorists, will come under the domination of the 
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Statistical laboratory, just as physics and chemistry 
departments are dominated by the experimental 

laboratory. • 

The connexion of statistics with biology is almost 
as close as with economics. As long as biologists were 
concerned with merely describing organisms and their 
functions, and with classifying them into types, 
statistics did not come into the picture. When measure- 
ments began to be made, and the existence of variation 
to be recognized, statistical ideas and methods became 
necessary and many modern statistical methods were 
first developed for biological applications. 

It was under the influence of Dar^vin’s »ileas and 
work that Gabon started his numerical studies of bio- 
logical variation and founded the Biometric Laboratory 
already referred to (p. 58). The main work of the 
biometricians during the early years of this century 
was the study of heredity in man, and of factors 
responsible for the ‘deterioration’ (as tiie trend was 
pessimistically characterized) of the race; but the 
scope inevitably broadened to include a variety ot 
biological problems. This statistical approach to 
biology and statistical methods are nvms. born and 
cradled together in the Biometric Laboratory. 

The description of populations of biological in- 
dividuals is statistical. The pages of biometric 
publications abound in frequency distributions ot the 
characters of men (height, weight, reaction times to 
sight and sound, and so on), of plants and flower®, an 
of animals; and there arc association and correlation 
tables showing the statistical relationships between two 
or more characters, such as the weight and vital 
capacity of men, the numbers of pistils and stamens m 
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flowers, and the heights of fathers and sons. The idea 
behind the phrase ‘like father like son’ is more exactly 
expressed as far as height is concerned by the statement 
that the correlation coefficient between the heights of 
fathers and sons is about +0-5. 

The methods of statistics find their use in all 
branches of biology, including its applications. Applied 
psychologists, for example, emphasize the existence of 
‘individual ditTerences’ between people, but they are 
really calling attention to variation which they treat 
on ordinary statistical lines. A good deal of applied 
psychology is concerned with developing tests for 
intelligence, for skill of various kinds, for accident 
proneness, and so on ; and the criterion of the value 
of such tests is that their results shall correspond with 
the performance of the individual in school or in some 
job. Such correspondence is never exact, however, so 
the pnjblcin of measuring its degree becomes one of 
statistical correlation. In this direction, psychologists 
ha\e dcNclopcd from the orthodox methods some 
variants of their own, to suit their special requirements. 

CJeneiics is esseniiallv a statistical subject, being 
concerned with the relations between the characters of 


groups ot in<.lividu.\ls in succe.ssive generations. The 
earlier \vr»rl; of the biometricians in this connexion was 
largely descripti\e and empirical. 'I’his was perhaps 
necessary and ine\itahlc in tlie early stages of the 
subject. ;ind the knowledge gained has presumably 
been of \ alae ; but tlie work seems to have been almost 
sterile as tar as the progress of genetics is concerned. 
A more fruitful line of attack has been based on theories 
developed from Mendel’s discoveries. This has pro- 
leedetl ah>ng statistical lines, and elaborate and highly 
developed statistical and mathematical methods now 
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form the basis of a large and important branch of the 

subject. . 

Statistics is of importance m medicine. \ ital 

statistics, epidemiology and public health, are rightly 
regarded as being statistical, since they are concerned 
with masses of people, and the treatment of -these 
subjects is, on the whole, adequate from the statistical 
point of view. Statistical data and methods are ahso 
used in research in many branches of medicine — often 
competently, sometimes not so. Generally, there is 
room for increased use of modern statistical metliods 
in medical research, and I think that even the general 
practitioner would be better if he was trained to be 
more ‘statistically minded’. 

In agriculture, correlation methods have been used 
to determine, from obser\'alions on farms not under 
control, what factors influence such things as the 
quantity and quality of crops. Such investigations 
have included the measurement of the efTects of raintall, 
sunshine, and temperature on the yield of various 
crops; of the relation between variations from farm to 
farm of the gain in weight of cattle and the quantity 
and quality of food; and of similar relations between 
the fertilizers used and the yield of various crops. 
This kind of investigation seems to have been more 
characteristic of American than of English agricultura 
research. In England, energy has been concentrated 
more on experimental investigations. 

Quantitative biological experiments in all branches 
involve working with variable material and they provnle 
an enormous field for the application of the sarnplmj; 
and experimental methods described in Chapter VII. 
There arc also many routine biological tests of counts 
of bacteria in milk, of the germination of plant seeds, 
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counts of blood particles, and so on ; and the biological 
value of batches of substances like insulin is tested by 
measuring their effects on animals (e.g. rabbits) that 
differ in their individual reactions. In all these 
instances, ttvo important questions have to be asked: 
‘\\'hat is the most economical way of arranging the 
tests to give an average result of required accuracy?’ 
and ‘How many tests must be made to attain this 
accuracy ? ’ Statistical methods provide the answers, 
and it is only in so far as the second of these questions 
has been properly answered that statements of, say, the 
vitamin content of various preparations can be relied 
upon, and substances like insulin can be reliably 
standardized. 

A great revolution took place when statistical ideas 
were imported into physics and chemistry — particu- 
larly the former. Physics had always been regarded 
as dealing with invariable constants of nature, perfectly 
determinate and measured with great precision. Very 
little room for statistics there! The conception of 
matter as an aggregate of elementaty particles — atoms 
— is an old one, and contained nothing statistical, since 
all the atoms were alike. When, however, the particles 
were given different characteristics the aggregate 
became a statistical population and the laws of its 
behaviour statistical laws. This happened first with 
the kinetic theory of gases, in which the molecules of 
a gas moved in tliffcrcnt directions with different 
velocities, and now applies to the modern theories of 
matter in terms of electrons and the like. Indeed, it 
seems that the only way of visualizing the electron 
nowadays is as a kind of a blur of probabilities. The 
statistical approach is not often necessary if the 
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physicist is interested only in the properties of matter 
in the mass, i.e. the behaviour of the aggregate of ele- 
mentary particles, but it is when he attempts to relate 
such properties to observations on the elementary 
components. 

Physicists have developed their own statistical 
methods almost independently of the work of statis- 
ticians in other fields — there is little in common 
between^tatistical mechanics and the kinds of statistics 
described in this book. It may he, however, that the 
two branches of the subject will be related one day. 
Certainl)' the ideas used arc not unique to physics. 

Statistical ideas have also some place in modern 
chemical theories. Chemists are attempting to explaiit 
the behaviour of substances like cellulose, rubber, and 
proteins by postulating aggregates or chains of mole- 
cules of different lengths and weights. Changes in the 
behaviour of the substance as a result of chemical 
change are explained in terms of changes in the 
frequency distribution of chain length. 

In both physics and chemistry, measurements made 
in the laboratory are subject to experimental errors. 
Statistical methods are somewhat usetl for dealing with 
these, although, as I have stated in Chapter 
this application is limited. Physicists and chemi^t^ 
increasingly have to make measurements on variable 
material, however, particularly since the development 
of biophj’sics and biochemistry and the extended u»e 
of physics and chemistry in industry. Moreover, it 
may sometimes be necessary in technical research to 
make investigations or test laboratory conclusions in 
factories. For a variety of reasons, perfectly controlled 
experiments are not possible in a factory, but some 
degree of experimental control can often be achieved 



STATISTICS 


174 

without unduly upsetting the factory routine, and a 
statistical experiment can be arranged. Also, a statis- 
tical analysis of the records of physical and chemical 
tests on output and quality, that are often kept in 
factories as a routine, may sometimes suggest the 
existence of technical effects or the causes of unwanted 
variations. .\ll these situations open a wide field for 
the application of statistical methods to sampling, 
arranging experiments, and analysing and testing the 
significance of results. 

Meteorology is usually regarded as a physical subject, 
presumably because the causes of weather changes are 
physical, but the subjc*.t also has statistical character- 
istics. T he meteorologist has no control over weather 
variations, and can only record and reduce them in much 
the same way that we do other statistical data, using 
frequency distributions, averages, correlations, and so 
on. Many of the meteorologist’s diagrams are of a 
special cliaracter, however, since he ma> require to 
represent at the same time, say, the wind strength and 
direction at various stations measured at various times 
of the year, ^\eather forecasting is based on the same 
logical and mathematical principles and methods as 
business forecasting. 

In engineering experience there exists that un- 
controllable variation which always indicates a field 
for the application of statistics. The materials the 
engineer uses, both raw and manufactured, vary in 
strength, size, and quality ; loads borne by his structures 
and machines vary (e.g. wind pressures, the amount of 
traffic on a bridge, the demand for electricity); and he 
is often unable to control all the working conditions, 
such as temperature or humidity, of the processes in 
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his charge. Sometimes, the variation is small abso- 
lutely, but must be considered because it is large 
compared with the precision that is required. Conse- 
quently, use must be made of frequency distributions, 
averages, measures of variation, and so on. For 
example, the strength of metal specimens is related to 
their hardness, and since tlie strength test is destructi\ e 
and the hardness test is not, this relationship is of 
value. It is not exact, however, hut is a statistical 
correlation, and should be regarded and treated as 
such. Also, sampling problems are raised in many 
engineering tests of materials and articles. 

Engineers have their own ways of taking account of 
variation, but they are not always the best ways. To 
allow for variations in materials and loads, engineers 
use a factor of safety w-hen designing a machine or 
structure, making the parts several times as strong as 
they would need to he if they all had the average 
strength and had always to bear the average load. 
These safety factors are empirical — they have been 
referred to as factors of ignorance — whereas it is 
theoretically possible to calculate them from the 
statistics of the variations. Such calculations have 
difficulties, but developments in this direction should 
be possible, and would almost certainly be profitable. 

Another engineering way of treating variation that 
is not always adequate is by the use of tolerance limits. 
Articles delivered to a specification are not expected to 
be all exactly alike, but are accepted if, and only if, they 
are within certain tolerance limits of the specification. 
Tolerance limits are suitable in specifications tor 
operations like machining, where it is easy, with care, 
to keep within such limits as are technically desirable, 
or where every article in each batch is inspected and 
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those outside the limits can be separated and rejected. 
But if the inspection is by sample, the rigid use of 
tolerance limits involves the rejection of a whole batch 
if even one article in the sample is outside them, 
although the very fact that a sample is used implies 
the possibility of accepting a batch in which some 
articles are outside the limits but none of them 
happened to come in the sample. It is illogical to be 
willing to run this risk and yet to reject another batch 
because one or two articles in its sample happen to 
come outside the limits. Such a rejection may also 
be uneconomic, or it may lead to tolerance limits that 
are too wide to be of value. For example, limits for 
the life of electric lamps would have to be set at (i) 
a little above zero and (2) 3,400 hours, if the batch 
represented by the sample of Table 5 (p. 41) is to be 
aiccepted! And a batch in which all lamps had lives 
between, say, 200 and 500 hours would satisfy a specifi- 
cation containing such limits equally with a batch with 
lives between, say, 200 and 3.400 hours. It is important 
to specify not only the allowable limits of variation, but 
also the proportions of articles in the dilferent regions 
between those limits. 

The movement for the application of statistics in 
engineering is part of the general movement for apply- 
ing the subject to technical control and research in 
industry. Engineers have not generally regarded 
themselves as needing statistical methods, but in recent 
years an increasing number have realized how useful 
the methods may be. and have applied them. 

Statistics finds occasional application to many 
subjects. In literature, ior e.xample, frequency dis- 
tributions of the lengths of sentences have been used 
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to characterize one aspect of the style of authors. A 
striking and most interesting statistical investigation 
in the literary sphere is a study of Shakespeare made 
by the late Professor Caroline F. E, Spurgeon and 
described in her book Shakespeare's Imagery. She 
presents tables of the frequencies of various types of 
images used by Shakespeare in five of his plays and in 
certain writings of Bacon and other contemporaries. 
In the preface to her book, Professor Spurgeon writes. 

‘Shakespeare’s images have, of course, constantly 
been picked out and drawn upon, to illustrate one 
aspect or another of the poet’s thought or mind, but 
the novelty of the procedure I am describing is that 
all his images are assembled, sorted, and examined on 
a systematic basis.’ 

She also asserts that : . . . , , i 

‘ in the case of a poet ... it is cluelly through 

his images that he, to some extent unconsciously, "gives 
himself away’’.’ 

Professor Spurgeon reaches one conclusion, aniong 
others : that there are two minds behind the works ol 

Shakespeare and Bacon. . . 

Statistical methods are also used in examining the 
results of psychical experiments. Such experiments 
are usually so arranged that their results cannot be 
explained by what we regard as natural causes, but 
require a psychical explanation if they are not attribut- 
able to chance. For example, a pack of cards may c 
shuffled and then turned up one at a time by an 
operator ; a subject who cannot see the cards states to 
what suit each one belongs; if he is right e 
one and if not he scores nothing. The question ki 
arises: is the subject’s score greater ^ 

attributed to chance, i.e.? is it greater than i ' 
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have been had he guessed the cards? The answering 
of such a question involves a regular use of standard 
statistical methods. 

For some subjects statistics provides ideas of basic 
importance; for some it provides methods of investiga- 
tion. In one way or the other, or in both ways, 
statistics has an impact on most other branches of 
knowledge. In this respect it is not unlike arithmetic. 
Arithmetic is so woven into the fabric of our think- 
ing that wc use it almost subconsciously, and, after 
leaving school, most of us are scarcely conscious of its 
existence as a separate department of study. On the 
other hand, most people are scarcely conscious of 
statistics except as a separate subject. I look fonvard 
to the day whci^ statistics will occupy a place in educa- 
tion only a little way behind arithmetic ; when everyone 
will learn as much of the subject as is necessary for 
ordinary life and for his particular vocation. Then 
e\eryonc will use statistics easily and naturally, and 
such general introductory books as this will become 
obsolete. 


NOTES ON BOOKS 


Except for the first, the books in the following list 
are more for the serious student than for the general 
reader. Nevertheless, anyone interested in the various 
subjects can usually learn something from reading 
parts of the books, even if he does not apply himself to 
studying the whole. 

My Apprenticeship by Beatrice Webb gives an 
interesting first-hand account of the life and work of a 
social investigator. Methods of Social Study by Sidney 
and Beatrice Webb and The Measurement of Social 
Phenomena by A. L. Bowley are more for the serious 

worker but are quite easy to read. 

Management, Planning, and Control by A. G. II. 
Dent shows the uses of statistics in scientific mamige- 
ment and gives a good bibliography. Business Fore- 
casting and its Practical Application by William \\ allace 
gives the attitude of one who has first-hand .contacts 

with both statistics and business. 

Statistical Method in Economic and Political Science 
by P. Sargant Florence gives a very full, general 
discussion of the relations between the subjects in its 

title. , £11 

Workers in the social, business, and economic hclds 

have available a very large number of textbooks on 

statistical methods, of which Elementary Statistical 

Methods by E. C. Rhodes is a good one to start with. 

and Elements of Statistics by A. L. Bowley is a good 

one to follow with. . 

Experimentalists, especially those working in the 

biological sciences, will find Statistical Methods by 

*79 
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George W. Snedccor a good introduction as well as a 
textbook. Statistical Methods for Research Workers 
and Design of Experwients by R. A. Fisher are important 
books, but the reader will not go far in them until he 
has learnt something of statistical methods. 

Medical readers will find Principles of Medical 
Statistics bv A. Bradford Hill a good introduction and 
An Introduction to Medical Statistics by H. IM. Woods 
and W. T. Russell a first textbook. 

Application of Statistical Methods to Industrial 
Standardization and Quality i'untrol by E. S. Pearson 
and Quality Control Charts by B. P. Dudding and 
\V. J. jennett deal shortly with general principles and 
give simple procedures for industrial application. 
Economic Control of Om(j/»7v of Manufactured Products 
bv W. .\. Shewhart is a fuller treatment of the whole 
subject. .\s a textbook there is An Engineer's Manual 
of Statistics bv 1.. H. Simon. 

'i'he textbooks so far mentioned are for particular 
applications of statistics. The classical general text- 
book is An Introduction to the Theorx of Statistics by 
(i. L’dnv Yule and M. G. Kendall. 
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